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Preface 


The present volume of the Mathematical Reference Library 
is devoted to the so-called method of statistical trials (the Monte 
Carlo method). In contrast with the earlier volumes, which were 
devoted to the classical divisions of mathematics and a sharply 
delineated subject matter with well-established terminology and 
traditions of exposition, the mathematical methods examined in 
the present volume have been developed only in the last thirteen 
years.* 

These methods, which are applied in the most varied fields 
of computational mathematics, are unified by a single common 
idea. 

They are based on the principle of simulating a statistical 
experiment by computational techniques and recording the numeri- 
cal characteristics obtained from this experiment. Therefore, 
all these methods are united under thecommonname of statistical 
trials or the Monte Carlo method, The solution of numerical 
problems by this method is closer in spirit to physical ex- 
periments than to classical computational methods. Error in 
the Monte Carlo method cannot be sufficiently well evaluated 
in advance and, as a rule, is found by determining the mean 
squares for the simulated quantities. In a number of cases the 
solution cannot be accurately reproduced. The solution is stable 
with respect to single errors in operation of the given electronic 
computer. 

It is the purpose of thepresent volume to show the fundamental 
distinctive features of the \Wionte Carlo method, giving a suf- 
ficiently thorough discussion of the facilities and typical procedures 
employed and the principal regions of application. 

In Chapter I (by Yu. A. Shreider) the distinctive features 
of the method are expounded and typical examples of its applica- 
tion in problems of purely mathematical computation are ex- 
amined. 


*Essentially, application of the Monte Carlo method dates back to the work of 
N. Metropolis and S, Ulam [203], which appeared in 1949, The term ‘‘Monte Carlo’’ 
appeared first in this work, 
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In Chapter II (by I. M. Sobol’) the principal region of applica- 
tion of the method—the evaluation of multidimensional integrals— 
is examined in great detail. In it, many methods of statistical 
Simulation of integrals are analyzed and the accuracy of the 
calculations is investigated. 

Chapter III (by I. M. Sobol’) and Chapter V (by V. G. Sragovich) 
are devoted to applications of the Monte Carlo method in those 
regions of physics and technology in which it is most widely and 
successfully used. (The method was first applied in the field of 
neutron physics.) These chapters discuss random processes (the 
passage of a beam of neutrons through a scattering medium, 
fluctuations in a radio signal, etc.) that are so complicated that 
analytic description of them is practically impossible. However, 
successful investigation of such processes can be achieved by 
their statistical simulation in digital computers. These processes 
have special peculiarities and in their simulation many special 
methods have been devised which it was found convenient to put 
into separate chapters. 

Chapter IV (by N. P. Buslenko) discusses application of the 
Monte Carlo method to investigation of mass-service processes 
(including queueing). This field, which has been developed only 
recently, is related to simulation of the most complicated sys- 
tems of equations and to operations research. It holds great 
promise for use in mathematical cybernetics. 

In Chapters VI (by D. I. Golenko) and VII (Sections 1, 2, 3, 
and 5 by N. P. Buslenko, Section 6 by V. G. Sragovich, and Sec- 
tion 4 by N. P. Buslenko and V. G. Sragovich together) the methods 
of organizing statistical experiments for general-purpose digital 
computers are examined. Important questions on every applica- 
tion of the method of statistical trials are examined in these 
chapters. 

Among the important fields of application of the Monte Carlo 
method, two remain unelucidated: self-organizing systems and 
reliability of complicated radio-electronic assemblies (e.g., com- 
puters). However, each of these fields is still in its infancy 
and the inclusion of appropriate material in a reference work 
such as this would be problematical. Questions relating to the 
first of these fields are partially explained in the work by R. 
Bush and F, Mosteller, Stochastic Models for Learning (John 
Wiley and Sons, 1955). 

Since the bibliography for the separate chapters coincides 
in a number of cases and also for reasons of convenience, a 
single bibliography on the Monte Carlo method appears at the 
end of the book (compiled by I. M. Sobol’). All citations in the 
text refer to this bibliography. 
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General problems of the Monte Carlo method have been ex- 
plained in a recent book by N. P. Buslenko and Yu. A. Schreider 
{11]. The present volume differs from it in its more elementary 
and detailed exposition of the material and in its references to 
a wider class of applications. 

The text does not discuss structural aspects of digital com- 
puters designed for solving problems by the Monte Carlo method 
and, in particular, questions concerning the design of special- 
purpose digital computers, 

A large group of authors participated in writing this volume. 
Although the work was done on a chapter basis, some unavoidable 
overlap has remained in the content, and there is also some 
variation in the style of exposition. 

The work is intended for awidecircle of readers varying from 
persons seeking familiarity with the fundamentals of application 
of the method to persons who are interested in comparatively 
detailed aspects of the simulation of physical processes. With 
the exception of Chapters IIland V, which are designed for readers 
familiar with certain general information on neutron physics and 
radio engineering, understanding of the greater part of the 
material presented in this book requires of the reader only 
mathematical preparation in a university course in mathematical 
analysis and also a knowledge of the fundamentals of probability 
theory, since probability concepts run like a fine thread through- 
out the entire book. It is assumed that the reader is familiar 
with the fundamentals of random events and random variables 
and with their probability characteristics (the probability of an 
occurrence, mathematical expectation, variance). Furthermore, 
he must have a grasp of the concept of normal distribution, 
Lyapunov’s theorem and, for certain chapters, of the elements 
of mathematical statistics. An understanding of Markov processes 
would also be very desirable. 

The authors hope that this reference book will prove useful 
to people who deal with the Monte Carlo method and its applica- 
tion and, what is no less important, will give many readers an 
understanding of the usefulness of the Monte Carlo method in 
the solution of various problems. The authors express their 
gratitude to V. D. Rozenknop for his valuable advice during prep- 
aration of this work. 


Yu. Shreider 


Chapter | 
Fundamentals of the Monte Carlo Method 


1. DEFINITION AND SIMPLE EXAMPLES OF THE 
APPLICATION OF THE MONTE CARLO METHOD 


The Monte Carlo method is a method of solving various prob- 
lems in computational mathematics by constructing for each 
problem a random process with parameters equal to the re- 
quired quantities of that problem, The unknowns are determined 
approximately by carrying out observations on the random process, 
and by computing its statistical characteristics which are approxi- 
mately equal to the required parameters, 

For example, the required quantity x may bethe mathematical 
expectation Mé of a random quantity. The Monte Carlo method for 
the approximate determination of x will then involve the N-fold 
sampling of t, yielding a series of independent valuest,, &, ..., Ey> 
and the calculation of the mean 


Erb eeb vee bby 
te. 


i= 


In accordance with the law of large numbers, when N is sufficiently 
large the mean is given by 


Ex ME=x, 


which holds with probability approaching unity. It follows that the 
quantity —, which is deduced from observations on the random 
process, is approximately equal to the required quantity x. 

Consider a simple example. Suppose it is necessaryto calculate 
the probability w that the total number of ‘‘hits’’ scored with an 
arrow shot at a target is evenafter ten attempts. If the probability 
of scoring a hit as a result of a single attempt is p, then the re- 
quired probability w can be calculated from the formula 


v= x ree ad 6 ag, (1.1) 
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This formula is clearly correct, since the general term following 
the summation sign is the probability that the number of hits is 
exactly equal to 2k. The calculation of the probability w using 
equation(1.1) and tabulated values of Ci} will involve 18 multipli- 
cations and 6 additions. 

Alternatively, one could carry out N series of 10 shots each 
and determine the number of cases, Z,wherea series of ten shots 
resulted in an even number of hits. When N is sufficiently large, 
the quotient L/N will be a good approximation to the required 
probabilityw. It will be shown below that in order to obtain an esti- 
mate of w which will be correct to two decimal places, it is necessary 
to carry out 10,000 series of trials each involving 10 shots, 

In the example just quoted, it is simpler to calculate the 
probability from equation (1.1) than to make 100,000 shots. 

This example is an illustration of how an unknown quantity may 
be determined from a real experiment. More precisely, the Monte 
Carlo method is usually understood to involve the construction of 
an artificial random process exhibiting all the required properties, 
but realized by ordinary computation means, ée.g., pencil, paper, 
numerical tables, calculating machines, and, occasionally, simple 
means of generating random quantities (the so-called random- 
number generators; cf, Chapter VI). 

In practice, the Monte Carlo method has found wide application 
only in the course of the exploitation oflarge computers. We shall 
see later that the method is particularly convenient with digital 
computers, As a rule, the Monte Carlo methodis used in general- 
purpose computers, although it is frequently feasible to construct 
special-purpose machines for particular classes of problems, 

We shall now consider how the above probability w can be 
determined with the aid of simple computational means. Instead of 
the arrow we Shall set up a process which will involve the use of 
a spinning top and a clock with a second hand, and also pencil and 


paper. We shall assume for simplicity that p=s. Instead of a 


series of 10 shots we shall spinthetop 10 times, and when it falls 
on the floor we shall look at the clock and read off the position of 
the second hand. If the second hand indicates a value x in the range 
0<t< 12 then we shall score a “‘hit.’’? Again, if in a series of 
10 spins we observe an even number of hits, then the series will 
be regarded as ‘‘successful.’’ Suppose that N series of such 
experiments, each involving 10 spins, result in Z successes, The 
random quantity L/N will then be distributed in precisely the 
same way as the analogous quantity obtained above with a real 
arrow, and will therefore be approximately equal tothe required 
probability w. 
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The experiment involving the spinning top is more economical 
but is no less time-consuming than the ‘‘real’? experiment, In 
order to accelerate the process it is necessary to use an electronic 
computer. The random process is then simulated with the aid of 
the computer, and its statistical characteristics are evaluated and 
printed out by the machine in the form of a final solution. 

In particular, the above example may be realized on a computer 
as follows. Many computers incorporate random-number gener- 
ators (cf. Chapter VI) which produce ineachcycle the value § of a 
random quantity, which is uniformly distributed in the range (0, 1). 
Instead of shooting an arrow or spinning a top, one can note the 
value = produced by the generator, and check whether the inequality 
t<p is satisfied. If it is, then we shall consider that a hit has 
been scored, It is easy to see that the probability for the inequality 
=< pto be satisfied, which is equal to the probability of scoring a 
hit, is, in fact, p. 

Let us select a series of 10 such values of &. If this series 
includes an even number of ‘“‘hits’’ then it will be regarded as a 
‘*success,’’ Suppose that among N series there are L successes, 
The quantity Z/N can then be interpreted exactly as above. 

The simulation of a single shot by this method involves two oper- 
ations, so that the imitation ofthe entire process with N= 10,000 
repeated series will involve 200,000 operations, Inthe case of the 
‘‘Strela’’?’ computer, this would require 100,000 seconds of time, 
which is already much faster than the actual experiment of 100,000 
shots, but is still much higher than the 24 operations which are 
necessary if the probability is to be computed from equation (1.1). 

For problems involving the determination of the accuracy of 
artillery fire from a large number of guns, the simulation of the 
process with a computer will occupy much less time than the 
analytical calculation of the required quantities with the aid of the 
same computer. Here, the method of statistical trials is virtually 
the only acceptable method of obtaining the final result. 

Occasionally, the analytical formulation of the problem (for 
example, the boundary value problem for the Laplace equation) 
forms the starting point. The appropriate random process (for 
example, the random walk process described in Section 5 of this 
chapter) is then found, and a study is made of this process. In 
other cases, the starting point is a random process whose 
analytical description is of little use in practice, or is not con- 
sidered at all. An illustration of this is the above calculation of 
the probability of hitting a target under given firing conditions, or 
the determination of the parameters of amass-servicing process, 

In the above example we were concerned with the precise 
simulation of a random process. In practice it is more usual to 
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compare the process under investigation witha simplified artificial 
process which canbe simulated ina computer and is in some sense 
an approximation to the original process. Thus, when the Monte 
Carlo method is used to solve the boundary value problem for the 
Laplace equation in a region G, the corresponding random process 
is the Brownian motion in the region @ with continuous time and 
precipitation on the boundary of the region. Onthe other hand, the 
situation is usually simulated by a random walk, with a discrete 
time, over a lattice inscribed in the region G, so that at each 
instant there is a ‘‘jump’’ of the Brownian particle from a site of 
the lattice to a neighboring site(cf.Section5), This simplification 
is usually dictated both by the absence of complete information 
about the real process and by computer limitations which 
prevent the simulation of a complicated process in an acceptable 
length of time. 

The Monte Carlo method has been most successful in those fields 
where the basic mathematical problem involves the investigation of 
some random process, Thus, problems in neutron physics are 
formulated in a probabilistic way, the separation of signals from a 
random-noise background is a probabilistic problem, and:so on. 
However, there are computational problems whose formulation is 
not related to the theory of probability, but which can be success- 
fully solved by the Monte Carlo method. The most typical examples 
are the boundary value problems for elliptical equations (for 
example, the Laplace equation), and the related problems for 
parabolic equations (for example, the heat transfer equation). 

These equations are closely connected with certain random 
diffusion-type processes, It follows that the solution of these 
equations can be conveniently carried out by simulating such 
processes. A method whereby this can be done is discussed in 
Section 5. It is interesting to note that when the relation between 
the boundary value problems and random processes was first 
discovered, the main interest was focused on the fact that the 
methods of the theory of differential equations could be used to 
study a wide class of random processes. In the Monte Carlo 
method, this classical situation is reversed: the simulation of 
random processes is found to be a very convenient method of 
finding the solutions of differential equations. 

A second example is the solution oflinear algebraic equations. 
Section 4 gives an example of a statistical simulation of a process 
whose characteristics are used to find the unknowns in a system of 
linear algebraic equations of the form «= Ax-+6, where Aisa 
matrix which is approximately equal to the unit matrix. Methods 
which may be used for more general systems have been discussed 
by Shreider [68]. 
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The development of methods based on probability theory for 
evaluating integrals is of particular importance. Since a prob- 
ability can always be regarded as a measure, it follows that the 
determination of the probabilities of some given events, or their 
mathematical expectations, can always be reduced to the evaluation 
of an integral. Consider the integral 


1 


fax, 


0 


where f(x) is such that 0 < f(x) <1 for 0< «<1, i.e., the values 
of f(x) lie between 0 and 1. It is required to find the area S of the 
region G bounded by the curve y= / (x), the x-axis and the ordi- 
nates x=0, x=1(Fig. 1), It should be noted that the limitation 
imposed on the function f(x) is unimportant because the scale 
can, clearly, be altered, Suppose that we choose a random point 
in the square 0< x<1, 0<y<1 by taking one whose coordinates 
are independently uniformly distributed inthe interval(0, 1). What 
is the probability p that the point will be in the region under the 
curve? Suppose that one such point is (&, 7). This point will lie in 
the square, Since 0<<£<1,0<7< 1. Itisclear that the probability 
p is equal to the area S, i.e., the required integral. 


YH ] Yyy 


Aone J 
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Fig. 1. 


Suppose we have some way of obtaining independent and 
uniformly distributed quantities = and 47. We will take the first 
pair of these quantities and check whether the condition 


f®™>2 (1.2) 


is fulfilled. If it is, then the chosen point (, 7) does lie in the 
region G under the curve, Next, let us take N pairs of random 
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quantities, and for all of them verify whether the inequality (1.2) 
is satisfied. Suppose that for ZL pairs of these N selections the 
inequality (1.2) is satisfied. The quantity L/N will then be approxi- 
mately equal to the probability that the random point lies in the 
region G, i.e., 


1 
<= p=f f(ade. 
0 


The estimated error in the integral evaluated in this way will 
again depend on the number of trials N, 

In both the examples discussed above, the simulation of the 
process involved the estimation of the probability of some event 
A, A more general case is one in which an estimate is made of the 
unknown mathematical expectation of a random quantity y. A 

1 


simple example will involve the evaluation of the integral f fi (x)dx. 


5 
Suppose = isa quantity which is uniformly distributed in the interval 
(0, 1), The quantity »n—/(§ will then have the mathematical 
expectation 


1 
a=Mn= [ f(x)dx. 
0 


Thus, in order to evaluate the integral, it is necessary to choose 
N independent values §,, f, ..., &y of the quantity & and form the 
arithmetic mean 

= FE )+F(&) + +» +F Ey) 

= a 7 
The quantity 7 will be an approximation to the integral. We note 
that an error in a few values of f(£,) will be smoothed out when NV 
is large, i.e., the method is independent of random machine faults. 

The examples considered in this section will serve as an 

illustration of the general characteristics of the Monte Carlo 
method, i.e., the necessity of carrying out extensive series of 
calculations of a given type, and the smoothing out of errors. In 
what follows, other characteristics of the method will become 
apparent, for example, the use of relatively small memory stores 
for intermediate results, and adaptability to multidimensional 
problems, All these characteristics show that the Monte Carlo 
method is very suitable for calculations on large digital computers. 
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2, ACCURACY OF THE MONTE CARLO METHOD 
AND ITS MAIN CHARACTERISTICS 


Let us now consider the accuracy of the Monte Carlo method. 
Suppose that an event A which occurs with probability p is to be 
simulated. Let & be a quantity which is equal to unity if the ith 
trial results in A, and zero if the event A does not occur. Thus, 
the total number of trials for which the event 4 does occur is 


where JA is the total number of trials, Individual trials will be 
assumed to be independent. 

The relative frequency of occurrence of the event A is equal to 
L/JN and is a random quantity whose mathematical expectation is 


M(4)=7MO=y YM Apa 
and whose dispersion is 


D(+)= DM= AS py, = Nop) _ 2p) 


i=l 


In the above expressions we have used the fact that the mathe- 
matical expectation for each of the &, is 


ME, =0(1— p)+1-p—=p, 
and the dispersion is 


DE, = Mii — (ME, = 0(1 — p)+1- p— p= p— p= p(1— p). 

According to the law of large numbers (Bernoulli’s theorem), 
the relative frequency of occurrence ofthe event A, which is equal 
to L/N,is approximately equal tothe probability p. More precisely, 
for any e > 0 and any 5 > 0 there existsa number of trials N which 
is such that with a probability greater than 1 —e, the relative 
frequency of occurrence of the event A will differ from the 
probability p, that the event A will occur, by an amount less 
than & 


|p — P| <2. (1.3) 
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Since p is the required quantity, while + is its approximate value 
obtained by the Monte Carlo method, it follows that the difference 
5 —p is the error of the Monte Carlo method. 

It is clear from the above that this error can only be estimated 
probabilistically with a degree of reliability 1— e«. Inwhat follows 
we shall usually assume that the degree of reliability is 0.99 or 
0.997. 

It should be noted that the use of computers does, in one 
sense, remove the distinction between the Monte Carlo method and 
ordinary computational methods, In fact, whenever a complicated 
computational problem is solved with the aid of a computer, it is 
always necessary to consider the possibility of random error (due 
to a fault or a rounding-off procedure) and therefore the final 
result can only be regarded as reliable with a certain probability 
approaching unity. Moreover, in complicated computational prob- 
lems, the error can only be estimated from the results of the 
calculation. 

The left-hand side of the inequality (1.3) can always be esti- 
mated with the aid of the Chebyshev inequality, 


4—ol<y 2a9. (1.4) 


where « is, as above, the probability that the estimate (1.4) does 
not hold, This estimate is derived in textbooks on the theory of 
probability. The most important fact is that the estimate is of the 


order of N72. In other words, the error 6 involved in the value 
given by the Monte Carlo method for the probability that the event 
A will occur is given by 


Bw (1.5) 


Although the estimate (1.4) may be considerably improved, the 
relation given by (1.5) will always hold, It has a very important 
bearing on the accuracy of the Monte Carlo method and the limits 
of its applicability. 

It is clear from (1.5) thatthe error 6 in the approximate solution 
of a problem by the Monte Carlo method can be reduced by 
increasing the number of trials N,i.e., by increasing the computa- 
tional time, For example, the time necessary to complete the 
solution must be increased by a factor of 100 if the accuracy is to 
be increased by one order of magnitude, It follows that the Monte 
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Carlo method cannot yield highly accurate solutions. In practical 
problems the Monte Carlo method will yield an accuracy of the 
order of 0.01-0.001 of the maximum value, unless special methods 
for accelerating the calculations are employed. 

It will be shown later that the Monte Carlo method is very 
Suitable for the solution of multidimensional problems. These 
problems do not usually require high accuracy, and hence the 
above disadvantage of the method is not as important as it might 
appear at first sight. 

Moreover, it should be noted that when the probabilities are 
computed by the Monte Carlo method (for example, the probability 
of hitting a target, the probability that a neutron will penetrate 
through the shielding of a reactor, and so on), high accuracy is 
frequently unnecessary in view of the very nature of the problem. 
Thus, it is immaterial in practice whether a target will be hit 
with a probability of 0.901 or 0.902. 

Suppose it is necessary to simulate a process for which each 
ith trial of N independent trials yields a quantity ¢, Suppose that 
this quantity has a finite mathematical expectation M:,;= a and a 
variance D§,=<«?. The arithmetic mean 


is then an approximate value of the required mathematical expec- 
tation a. The quantity — isthe result of the solution of the problem 
by the Monte Carlo method. The deviation ofthis quantity from the 
required mathematical expectation is in fact the error of the 
method, , 

As above, the estimate of the error is associated with the 
probability 1—«. This means that the estimate 


a=[E—al<oy , (1.6) 


which can be obtained in the same way as (1.4) from the Chebyshev 
inequality, holds witha probability not lessthan 1—e«. The relation 


a 
YN 


will hold as before with all the resulting consequences, 
Equation (1.6) clearly indicates the significance not only of the 
number of trials, but also ofthe variance of §,. One should always 
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bear in mind the possibility of reducing this variance, using, when 
necessary, special methods which improve the convergence. In 
fact, the number of trials N may be reduced by reducing a in the 
estimate (1.6), and this will, of course, also reduce the time 
necessary for the solution of the problem. Let us now improve 
the estimate (1.6), and as a special case derive (1.4), To do this, 
we note that the quantity L—=!,+§--+ ... +, isthe sum of a large 
number of random quantities. This is also true of the quantity 


t—=ytyt eee 40%, (1.7) 


It follows that the distribution of = maybe obtained from the limit 
theorems of the theory of probability. Let r, be a quantity which 
is such that with a probability « we have 


jE—al<r,. 
The quantity 7, may serve as an estimate of the error, which in 
general is better than (1.6), _ 

The most important case is that in whichthe quantity — is dis- 
tributed in accordance with the normal law (Gaussian distribution), 
The fact that £€ is distributed very nearly in this way can be 
established from some very general properties of the quantities & . 
This fact is ensured, for example, if the quantity 


6 
PYN (1.8) 


is sufficiently small, where } is the so-calledthird moment of £;. 
Since the values of N which are commonly employed in the Monte 
Carlo method are of the order of 103-~10° , it follows that the 
Gaussian distribution will as a rule be obeyed. The only exception 
will be the simulation of events occurring with a low probability 
p, in which case the Poisson distribution will hold for integral 
values of L. The condition that the limit law for L is the Poisson 
distribution is 


1 
p~. (1.9) 


In other words, it is necessary that the quantity pN should not be 
too large. If this condition is satisfied, the quantity given by (1.8) 
will assume the form 


—— = = 


which, in view of (1.9), isnotofthe order of zero, i.e., contradicts 
the normality condition. 
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Consider now the usual case, in which £ is distributed very 
nearly in accordance with the Gaussian law. When the error is 
known with probability «0.997 we have r,= 23,, where % is the 
standard deviation of §— (this is the so-called three-sigma rule). 

The variance oj of the arithmetic mean { is related to the 
variance o? of =, by the formula 


aaa ie (1.10) 


and hence the error of the Monte Carlo method may be estimated 
from 


i= [i—al<a- (1.11) 


This estimate is exact. If the quantity on the right-hand side of 
(1.11) is reduced, then the corresponding probability is reduced. 
The expression (1.11) shows that the accuracy of the Monte Carlo 
method depends only on the number of independent trials and the 
variance. In particular, the number of operations involved ina 
transition from one-dimensional integrals to multidimensional 
integrals will increase only as a result of the increase in the 
complexity of the integrand and in the number of uniformly dis- 
tributed quantities employed, i.e., in practice it will be directly 
proportional to the dimensionality. At the same time, the number 
of operations in the case of ordinary quadratures increases as 
K" where n is the multiplicity of the integral. This shows that the 
Monte Carlo method is very suitable for multidimensional prob- 
lems. This will be seen from examples which will be considered 
later. 

Equation(1.11) can also be interpreted in another way. Let 6 be 
the required accuracy ofthe calculations, and y the average number 
of computer operations per trial. The total number of operations 
F will then be given by 


Qvo2 


3 (1.12) 


Fo Nr 
and the total time for the solution of the problem by 


Te Nee OR, (1.13) 


where ¢ is the computer time per operation. It is clear from this 
formula that the time T must be increased by two orders of 
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magnitude if the accuracy is to be improved by one order of 
magnitude, 

It is usually relatively difficult to estimate the variance oc? in 
advance (i.e., prior to the solution of the problem), Theoretical 
estimates are, as a rule, much too high. In solving specific 
problems, the error may be estimated by substituting the statistical 
estimate of the variance 


aa & —tP+ (E+ oo. +Ey—ty 

eos = aod 
into the right-hand side of (1.11) instead of the theoretical variance 
oe, The statistical estimate is obtained as a result of the simulation 
of the values of the random quantity. 

Thus, the accuracy which may be achieved with the Monte 
Carlo method can only be properly estimated in the process of 
solution. This fact is analogous to the well-known result that the 
accuracy of a physical experiment canbe reliably determined only 
from the results of the experiment itself, 

The estimates given above were concerned with absolute 
errors, Letus return now tothe simulation of an event 4 occurring 
with probability », and estimate the relative error in the deter- 
mination of p with the aid of the frequency L/N. In this case the 
estimate (1.11) will be of the form 


_|t p(l—p) 
i w—P|<3V pee), 
and hence the estimate for the relative error is 


3 i— 

> <3V Sx: (1.14) 
Thus, the required number of trials can be determined in terms 
of the admissible relative error d from the formula 


901— 
NwoS), (1.15) 


which shows that the number of trials is inversely proportional 
to the required probability p. 

This means that very small probabilities cannot be determined 
by the Monte Carlo method. When the magnitude of N as given by 
(1.15), or, more precisely, the time T= N»z for the solution of the 
problem, is inadmissibly large, then it is necessary to try to 
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transform the problem into another with reasonable values of the 
probability P. 

We note that in deriving (1.14), use was made of an estimate 
based on the assumption that the distribution law is normal, 
although we were concerned with a low probability ». The point 
is that when pN~1, it is impossible in principle to obtain a low 
relative error, and it is therefore necessary to demand that the 
condition pN<=>1 should be satisfied. This in itself leads toa 
distribution law for the frequency M/N which approaches the 
normal distribution. 


3. GENERATION OF RANDOM NUMBERS 


In order to be able to solve practical problems by the Monte 
Carlo method, it is necessary to have a random-number generator 
with a sufficiently varied store of distribution laws. 

In this section we shall consider various methods of generating 
random events and random numbers. These problems will be 
considered in greater detail in Chapters VI and VII. 

It turns out that uniformly distributed random quantities are 
the most important. They may be used to simulate random events 
and random quantities obeying various distribution laws. 

Suppose we have a random quantity § which is uniformly dis- 
tributed in the interval(0, 1). Let us consider the problem as to 
how this random quantity may be usedto simulate an event A which 
occurs with probability ». Let us define A as the event that ‘‘the 
chosen = satisfies the inequality :< p.’’ It is easy to see that the 
probability that the event A will occur is 


Pp 
fax=p, 
0 


and hence the event A will occur with the given probability p. 

The uniformly distributed random quantities : may be used to 
obtain other quantities which obey practically any given distribution 
law F(x), All that is necessary is toreplace : by some monotonic 
function of :, for example, y= /(§). The quantity 7 will follow the 
distribution law 


Faas) 
F(x)= f h(t) dt, (1.16) 


—-o 


where f '(x) is the function which is inverse of f(x) and A(x) equals 
unity for0 < x < 1, and zero outside the interval (0, 1). 


ee 
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In order to obtain the exponential distribution law for which 
for x<0, 
Py =| 1—e-* for x>0, 
it is sufficient to take 


f(s) =—+In(1—»). (1.17) 


In, fact, the function which is inverse to f(x) will be of the form 
ta Vaxj=1 — exp(— o.), and in accordance with (1.16) 


l1-e7~ 


F(x)= f moat =| 


oO 


0 for x<0, 
—e-* for x >0, 


Since for x >0 the upper limit lies inside the interval (0, 1), the 
quantity 7 will in this case be distributed in accordance with the 
exponential law. 

In order to obtain quantities distributed in accordance with the 
Gaussian law, one can use yet another construction, According to 
Lyapunov’s theorem, the quantity 


pay Pla tet a +43] 


will approximately follow the normalized Gaussian law for 
sufficiently large n, where &,, §, ..., &, are independent, uniformly 
distributed quantities. In practice it is quite sufficient to take 
n=5. The values of the corresponding empirical distribution 
obtained by this device with the aid of the ‘‘Ural’? computer are 
given in Table 1. In this calculation 1,000 values of § were taken 
to compute the empirical distribution law. 

Thus, the main problem consists in finding a method of obtaining 
uniformly distributed random quantities, 

Consider the discrete random quantity x, which can assume 
the values | or 0, each with probability 0.5. For example, we 
could toss a coin and define x,—1 as corresponding to ‘‘heads’’ 
and x,=0 as corresponding to ‘‘tails,.’’ 

Consider now the infinite sequence +,,.x,,..., x; ...and look 
upon this series of zeros and units as the binary digits of some 
number = given by 


e= x27". 


The number ¢ isa random number lying in the range 0<§<1. The 
probability that — will lie in the interval (0, 0.5) is equal to 0.5, 
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Table 1 


0.001 
0.005 
0. 006 
0.010 
0. 002 
0.000 
0. 000 
0. 006 
0.014 
0. 003 
—0. 007 
—0. 007 
—0. 002 
—0. 007 
—0. 002 
0.005 
0.005 
0. 001 
0.001 


—(, 


3 
0 
3 
6 
9 
2 
5 
8 
1 
4 
7 


NNNRFRFRrROOCS 


. 


the probability that it will lie in the interval (0, 0.25) is equal to 
0.25, and so on. In general, the probability that § will lie in any 


interval of the form (a =) is equal to the length of the 


interval a It follows that — is a uniformly distributed random 


quantity. Hence, we have a method of simulating a uniformly dis- 
tributed random quantity. The method requires the availability 
of an infinite sequence of independent random quantities |x,;} which 
are looked upon as the binary digits of a number &.. 

In practice, it will be necessary to cut off this sequence at a 
finite value of n and obtain a quantity t* whose distribution has the 
form of a step-function F,(x). This function will approximate the 
uniform distribution law given by the function F (x) which is defined 
by 

0 for x<0, 
F(x)=j x for 0O<*x*<l. 
] for x1, 


16 I, FUNDAMENTALS OF THE MONTE CARLO METHOD 


The deviation can be estimated from the inequality 
1 


There are two main types of physical generators for the random 
numbers x;, The first type makes use of the emission from radio- 
active matter, and the second is based on the intrinsic noise of 
electronic tubes. 

Consider the first type and suppose that we have a radioactive 
source, A counter is used to count the number of radioactive 
particles which are emitted in a time Af. If the number of these 
particles is odd then x, istakentobe unity. If the number is even, 
then x; is taken to be zero. Itis usually considered that the prob- 
ability w, that the counter willrecord k particlesin the time Af is 
given by the Poisson distribution 


GAM paras, 


toes 
The probability that an even number of particles will be 
recorded is then given by 


2k 
n=), = >) (2. At) en hat, (1.18) 


k=0 k=0 
Substituting z—)At we have 


o 


gzk 
Po= 44 Gy! 


e~* =e” cosh z= 


Hence, it is clear that the probability of recording an even number 
of particles in the time 4¢ is given by 


1 ea 2hat 
Y= ito . 


When } Af is sufficiently large, p) will tend to 0.5. 

The quantity Af is equal to the mathematical expectation of the 
number of particles recorded in A‘. Hence, it is clear that the 
time Ar necessary to obtain a single binary digit must be such that 
the average number of particles recorded in At is sufficiently large. 


When exp(— 2h A¢ )= 0.01, then Mt~ a In 100. Hence, 4At~ Into, 


which corresponds to roughly three particles. 

When it is required to obtain m binary digits, then the time will 
be greater by a factor of m, Assuming that with the accuracy 
usually required the quantity & is taken to 15 binary digits, it 
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follows that the time necessary to obtain a single uniformly dis- 
tributed number is such that the total number of particles 
recorded in that time by the counter is 40 to 50. When it is 
necessary to obtain normally distributed numbers, then this time 
will increase accordingly. This constitutes a limitation of the 
applicability of this method of generating random numbers, 

Consider now the second method. Electronic tubes are always 
found to exhibit intrinsic noise which, with suitable amplification, 
will give rise to appreciable fluctuations in the output voltage, 
There are no fundamental difficulties in constructing an electronic 
circuit whose output voltage U (¢) will be a random quantity. The 
values of this quantity may be chosen at widely spaced times 
ths tor wees tgs ++» , SO that U(t,), U(t), .... U(t,) can be regardedas 
independent. The quantity x, can now be defined by 


_ {0 for Ut)<a, 
s—{ 4 for U(t,) >a. 


The quantity @ is usually referred to as the cut-off level. It 
should be chosen so that the probability ofx,=1 is 0.5. In fact, the 
main difficulty lies in the correct choice of a, Various methods 
are available for this purpose. The simplest method is to take a 
pair of values x, and x,,, andconstruct a quantity y, in accordance 
with the rule 


(1.19) 


=|,’ if x,=1, *;4,;=0, 
WO, if pi eI. 


If, on the other hand, «;—=.~;,,, then y, is determined by taking the 
first of the subsequent pairs x;1,, *;1,,, Whose members have dif- 
ferent values. If the probability of x,= 1is denoted by w then the 
probability that y,=1is 


w(l—w) 1 


w(il—w)+(i—w)w 2° 


It is easy to see that the searchfor a suitable pair, x,,, x,.,,, Will 
be completed in a finite number of steps, and will occupy, on the 
average, 


1 
2w (l— w) 


steps. When w= 0.5, this will amount to 2 steps. 
The second method is particularly convenient when w is close 
to 0.5. Let w= 0.5-+e and let us define y, by 
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{ 1, if x,# x,y 
A= , 
0, if x,=x,_). 


In this case, the probability that y, = 1 is 
ow(1 —w)=2(5+ °)(5— e) = 2(7—#)= 5-28, 


i.e., it is much closer to 0.5 than is w, ife< 0.5. This method 
may be iterated by taking 2” consecutive values of x,, When a 
trigger circuit is used as the source of noise, the above method 
will require the alternate selection of the output signal from dif- 
ferent outputs of the trigger, depending on the value of the 
preceding output signal. 

If a particular computer does not incorporate a random-number 
generator, then use is made ofthe so-called quasi-random number 


sequences. A quasi-random sequence £,, §,..., &,... iS one de- 
fined by a recurrence relation 
ae ae ee eS (1.20) 


which is such that when i1<n<WN the sequence it generates has 
the statistical properties of a sequence of independently chosen 
values of a uniformly distributed random quantity (the distribution 
need not be uniform, although it must be known). 

An example of the generation of quasi-random sequences of 
the first order (k=1) which was analyzed with a computer is 
given below. The rule consists of the following. Lett,_, be an 
mth order binary number of the form 


En = 8,27" oe ae Se oe a 
The square of this number is then of the form 
B21 = 812 4-827. Bog 


Let us select the central portion of this number (assuming m 
is even) and replace (1.20) by 


_ -1 -2 -m 
f= SG =Fm, 2 +3n,? -- eas +8an2 . 


This method may be used to obtain satisfactory sequences for N 
of the order of a few thousand. Satisfactory quasi-random se- 
quences of considerable length may be obtained by using higher 
order sequences (k=2 or 3), There are also other methods of 
constructing quasi~random sequences which are used with elec- 
tronic computers. Some of these methods are based on operations 
peculiar to the particular computer (cf. Chapter VI). 
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4, SOLUTION OF SYSTEMS OF LINEAR 
ALGEBRAIC EQUATIONS 


The solution of systems of linear algebraic equations is of 
great importance in many fields in which computational mathe- 
matics is employed. This is one of the reasons for considering a 
further class of methods for the solution oflinear systems, which 
are based on the simulation of random processes. One of such 
methods is described below. Other methods for the solution of 
systems of linear algebraic equations are described elsewhere 11). 

Consider a system of linear equations which may be written in 
the vector form 


Ax =D, (1.21) 


where A is an n-dimensional matrix and *and 6 are n-dimensional 
vectors, Suppose that we are dealing witha case where the method 
of simple iteration is possible, i.e., the matrix A can be written in 
the form A—E—8,where EF is the unit matrix and B has charac- 
teristic numbers whose moduli are less than unity. The system 
given by (1.21) is then equivalent to the following ‘‘iterative’’ form: 


x=Bx-+b. (1.22) 
The solution of (1.21) may be written in the form 
x= A'b. (1.23) 


Subject to the above assumptions, the reciprocal matrix may be 
expanded into the series 


AS84648. EB (1.24) 


This series will converge when, and only when, the character- 
istic numbers of the matrix B are such that 


|); (B)| <1.* 


Let us now substitute (1.24) into (1.23) and write the solution in 
the form 


*=6+ Bb+ B+ ...+B"b+... (1.25) 


The partial sums of the latter series may be obtained by means 
of the usual iteration method by successively writing 


*This is also the condition for convergence of the method of simple iteration, 
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x0 =6+6 
anil oni ten et 


abe Db Br- 1o+ Bet... +8, 


The convergence of the method of simple iteration is equivalent to 
the convergence of the sequence +!, +x), ..., x4), ... 

In order to evaluate x let us rewrite these expressions in terms 
of their coordinates. Let the elements of the matrix B be denoted 
by B, ; 80 that the mth coordinate +, of the vector x is 

Xin =b at DL Bmibi + Pt Bri Bi i, =P 4 


2 2 


oe =. : Bmi Bit, --+ Bi,_ 1 i 


ror fr” 
r 


(1.26) 


This expression will be used later. The problem consists in 
finding a method for evaluating this sum, 

To begin with, let us consider the case where the elements of 
the matrix are positive and the sum of the elements in each 
column is equal to unity,* i.e., 

SB 


ye mai. 
: 


The quantities B,, can then be regardedas the probabilities for an 
exhaustive set of incompatible events, 

In the preceding section, we were concerned with methods of 
simulating random events. In the present case, it is necessary to 
simulate a system of random events. 


Consider n independent values &,, &,... & which are uniformly 
distributed in the interval(0, 1). Letus now subdivide this interval 
into n lengths B,,,.Bjo +++ Ban (m@=1, 2,... ) If &, lies inthe ‘th 


section (length 8,,,) then we shall put y,, =, This defines n random 
quantities y,, y., .... 3, The mathematical expectation of y,, is 


n 
MY n= pa Bribp 


i.e., it is equal to the second term in the series (1.26). 

We shall attempt to obtain a random number whose mathe- 
matical expectation equals the third term of the series (1.26), 

To do this, consider again the quantities §,. If &,, lies in the 
i,th section, then we will try another value, If¢, lies in thes th 


*We are considering this case only toelucidate the probability model, In reality, the 
series given by (1,25) will diverge under these conditions, since the resulting matrix 
necessarily has a characteristic value } = 1. 
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section, then we shall put Z,=0;,. This defines the random 


quantities Z,,. The probability with which the random quantity 
Zm assumes the values 4, is equal to 


x Bmi,Bi,iy 

a 

and the mathematical expectation of Z, is given by 
MZ,,= 2 BniBisii,» 


The above construction makes possible a simple generalization 
whereby it is possible to obtain a random quantity 7, whose 
mathematical expectation is equal to the sum of the series 
(1.26), i.e., 


Man = Xm 
Thus, by simulating N times the process of determination of 
Mm and as a result obtaining the sequence 7}, 77, ..., 7X , one can 


evaluate the arithmetic mean 


- — tatint-- th 
=o 


Tn = 


which may be taken as an approximation of the required quantity 
x,. The error may be estimated by the method described in 
Section 2. 

The random quantity »,, may be constructed as follows, Each 
element B,, of the matrix B may be written in the form ofa 
product of two factors 

Bing = FP npPmp (1.27) 
where 0<P,,;< 1.* The components of the vectors on the right- 
hand side of (1.21) will be written in the form 5,—/,p, where 


0<p<1.* 
It will be assumed that 


Pmt 24 Pm = t. (1.28) 


“It will be seen later than when B,,; = 0 it is convenient to let Pp,; = 0. 
**When 6; = 0 it is convenient to let pj; = 0, 
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Equation (1.26) can then be written in the form 


Xm = SmPmt QF nif Pm P+ ms 
vot 2 Paty tty =e Pi aat,TiPmt, oF, (1.29) 


bys dgreos Ey 
a ee 
= TE a : 


Consider now ” subdivisions of the interval [0, 1], each into 
n-+1 segments. The lengths of the segments obtained as a result 
of the mth subdivision will be 

Pins Pra ones Pron: Pant 

Let &, &,.--» & .«..be values of independent, uniformly dis- 
tributed quantities. The quantity 7, will be definedas follows. 

To begin with, consider the mth subdivision of the interval 
(0, 1] and determine which segment correspondsto &. If % corre- 
sponds to the (n-++ 1)th segment (length p,,) then we assume that 


Am = Sm (1.30) 


If §& corresponds to the /,th part (length Pmi,) then the i,th sub- 
division is selected and the segment of this subdivision which 
corresponds to §, is noted. Again, if {corresponds to the (n+ 1)th 
segment of the subdivision, then we assume that 


Ye mi Sin (1.31) 


If, on the other hand, §, corresponds to the /,th segment of the 
subdivision, then & is selected and the segment of the é,th sub- 
division to which it belongs is noted. This process is continued 
either until one of the quantities & falls on the (n-+1)th segment 
of the /th subdivision, or indefinitely. 

It may, however, be shown that, with a probability equal to 
unity, the second case (infinite continuation) is never encountered. 
This follows from the general properties of Markov chains which 
are discussed in Section 6 of this chapter. 

The quantity »,, will thus assume values which are determined 
by the history of the process. In particular, it assumes the value 


miy tty * Fig tj (1.32) 
with probability 
Pri Pit, eae Pi oP i. (1.33) 


Thus 7, assumes the values given by (1.32) if & lies in the ‘th 
segment of the mth subdivision, ¢, in the i,th segment of the 4th 


t 
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subdivision, and so on, while =, lies in the(n- 1)th segment of 
the &th subdivision. 
The mathematical expectation of 1, is 


M1n=SmPmt » FrisF its eee Pry ti, . 
k 


Ral fy ly wend 


(1.34) 
> Pm, Piris eee Pigs Pigs 


This follows from (1,32) and (1.33), 
Comparing (1.34) with the expression for the mthcomponent of 
the unknown vector «, which is given by (1.29), we have 


Mn == ¥ me (1.35) 


This identity may be used to calculate x,, by a Monte Carlo simula- 
tion of the random quantity 7,, . 

We have thus arrived at a Monte Carlo process for the solution 
of systems of linear algebraic equations. There are also other 
processes of this kind where a statistical model is used for the 
determination of the reciprocal matrix, or for the solution of 
(1.21) with an arbitrary matrix A. 

The particular advantage of the above method of solution lies 
in the following. When systems of linear equations are solved by 
the usual method, the determination of any one of the unknowns 
requires a knowledge of all the remaining unknowns. In the above 
method, on the other hand, this is not necessary; each time, a 
single coordinate x,, is determined. This is shown by the fact that 
the number of arithmetical operations is proportional to the 
number of equations and not to the cube of this number as in the 
standard numerical methods. This is one of the fundamental 
features of the Monte Carlo method, namely, its suitability for 
and convenience in the case of multidimensional problems. 


5. RANDOM WALK AND THE SOLUTION 
OF BOUNDARY VALUE PROBLEMS 


Boundary value problems and problems with initial conditions 
for linear differential equations are among the more interesting 
fields where the Monte Carlo method is applicable. The connection 
between the solutions of such problems for certain classes of 
equations, and stochastic processes of the random walk type, has 
been known for some considerable time [232]. However, the 
application of this fact tothe practical solution of equations became 
possible only after the development of computers. 
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We shall see later that the Monte Carlo method is particularly 
effective in the case of multidimensional problems, since the 
time necessary for the determination of the value of the solution 
at a given point depends only on the diameter of the region. 

In order to illustrate the basic idea of the method we shall 
consider the Dirichlet problem for the Laplace equation. Consider 
a function f(Q) which is defined on the boundary of a simply con- 
nected plane region G, It is required to find a function u(P) which 
satisfies the Laplace equation Au —0 within the givenregion G and 
assumes given values u|r=/(Q) on the boundary I of the region. 
This problem is usually reduced to a finite difference scheme. 

Consider a planar square lattice with a spacing equal to#. We 
consider only those nodes of the lattice which lie inside the region 
G, They can be divided into two groups, namely, those which have 
four neighbors inside G and which are known as internal nodes, 
and those which have fewer than four neighbors inside G and which 
are referred to as boundary nodes (Fig. 2), 


,Sascesexe 
ACCC, 


OBoundary nodes @Internal nodes 


Fig. 2. 


The function « assumes the given values u(Q)= /(Q) at the 
boundary nodes; the values of w on the boundary are attained at 
the boundary sites by means of special rules, 

The values of the function u(P) at the internal nodes will be 
sought using expressions of the form 


a(P)=4[aP)+4(P,)+a(P)+4(PyL (1.36) 


where P,, P,, P, and P, represent the four sites which are the 
nearest neighbors of P and lie either inside the region or on the 
boundary. Equation (1.36) is the usual finite difference system, 
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Consider the probabilistic scheme related to (1,36). It is 
usually referred to as the random walk problem, We shall regard 
the squares of the lattice as blocks in a city and the nodes as 
crossroads. Suppose that a drunken man leaves a node P and can 
reach any of the neighboring nodes with equal probability (0.25). 
Similarly, having reached the next site (crossroads) he can, with 
equal probability, reach any of the four possible new neighbors. 
We shall assume that the town is surrounded by a very deep 
moat, which means that when the man reachesthe city boundaries 
(i.e., a boundary node of the lattice), he will remain at this site by 
falling into the moat. The problem is todetermine the probability 
that such a man leaving a site P will complete his random walk at 
the boundary node Q. It may be shown that, with probability equal 
to unity, he will eventually reach the city boundaries. The required 
probability cannot easily be obtained in an explicit form. How- 
ever, it is quite simple to derive a relation involving the prob- 
ability u(P, Q). We note that the walks which, starting from P, 
terminate at the point Q are composed of four mutually exclusive 
classes, namely the walks in which the man reaches Q from P via 
P,. P,, Ps and P, where P,, P,, P;, P, denote the four neighbors of 
P, Since the probabilities of reaching P; from P are all equal to 
0,25, the law of combination of probabilities yields 


4 
u(P, Q=+ Ya, . (1.37) 
i=l 


We havethus arrived at the finite difference equation (1.386). More- 
over, the probability u(P, Q) satisfies the boundary conditions 


4(Q, Q=1, 
4(Q’, Q=0 (Q'#Q), 


where Q and Q’ are boundary nodes. 

It is known that there exists a unique function which will satisfy 
(1.36) for the given boundary conditions. 

By simulating N times the random walk of the man, one can 
determine the number oftimes, Z, which will result in his reaching 
Q from P, and this will give an approximate estimate for u(P, Q), 
namely 


(1.38) 


£mu(P, Q). 


We thus have an approximate solution of (1.37) subject to the 
boundary conditions (1.38), 
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The probability scheme must be generalized somewhat when it 
is required to solve the Dirichlet problem with general boundary 
conditions. 

Let us assume that when the man falls into the moat ata 
boundary node Q, he must pay a fine* f(Q). It is clear that the 
magnitude of the fine §(P) paid by the man starting from the point 
P is a random quantity. 

It may assume the values 


F (Qq)s fF (Qa)e oer F(Q.) 


where {Q;. Q, .... Q;} is the set of all boundary sites, The prob- 
ability of paying a fine f(Q,)isequalto a(P, Q,), so that the mathe- 
matical expectation of the fine is 


w(P) = ME(P) = BF Quel, Q). (1.39) 


It is clear that the magnitude of w(P) depends on the starting 
point P, The function w(P) satisfies the difference equation 


4 
w(P)=+ >) w(P). (1.40) 


f=1 


In fact, substituting Q =Q, into (1.37) and multiplying both sides by 
f(Q;) we obtain equation (1.40) after summation over all the 
boundary sites Q,. 

The function w(P) satisfies the required boundary conditions at 
the boundary nodes. In fact, substituting P—Q in (1.39) we find 
that, in view of (1.38), all the components on the right-hand side 
of (1.39) will vanish except for 


w(Q)=4(Q, Q) fF (Q)= Ff (Q). 


Thus, the function w(P) assumes the prescribed values on the 
boundary, i.e., it is a solution of the Dirichlet problem. 

Consider now the time necessary for the solution of the prob- 
lem, Suppose that the coordinates of the site P are(x , y)). The 
coordinates of the four neighboring sites are then 


P,=(x¢+ 1, Yo) 
Py =(Xp Yot!), 
P,==(Xp— 1, Yo) 
P,=(Xq Yo— 1). 


*The fine depends on the part of the boundary in which he is found, 
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Let the current coordinates of the man’spositionbe (x. y),so that 
initially x—x,, yy). When he reaches a neighboring sitefrom P, 
unity is added to or subtracted from one of the coordinates (x or y), 
This is continued until he reaches the city boundaries. The 
process involves the recording ofthe coordinates (x). y,)and (x, y). 

We shall now calculate the number of operations required for 
the completion of the random walk at the boundary. Suppose that 
the number of nodes which the man passes through in the ‘th 
random walk is v,. The time for the solution, 7, is then given by 


T=t(ytyt ... +Y%y) (1.41) 
where # is the time spent between successive nodes and N is the 
total number of simulated random walks which are necessary for 
a solution of the required accuracy. The quantity N can be deter- 
mined from the law of large numbers in the usual way, and is 
given by 


N= wee) Ps ser ; (1.42) 


where e is the error of the solution, D&(p)is the variance of the 
quantity §(p) and max| f(Q)|istaken over the boundary nodes. This 
inequality may be obtained from a simple estimate of the variance 
of &(p). 

In fact, 


DE ( 7) = M2(p) — [ME (p)]? < M2(p) salem 2, 


since all values of |§(p)| do not exceed max| f(Q)]. 

The number of nodes passed through by the man in a random 
walk is also a random quantity. It follows that the sum of the 
quantities v, is approximately equal to the mathematical expecta- 
tion of vy multiplied by N, sothatthe time necessary for the solution 
of the problem is T=:‘N(Ev). The mean number of steps in the 
random walk, Ey, depends on the form of the lattice and the 
spacing A. 

It turns out that the number of steps depends only on the linear 
dimensions of the lattice, provided it is assumed (as above) that 
h=1, If r is the radius of the region G, then Ev~,r?.* It is 
significant that this fact holds for a lattice of any number of 
dimensions, i.e., for any number of independent variables on 
which the required function u(p) depends. In general, the time 


*This is shown by Buslenko and Shreider [11]. 
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necessary for the solution of the problem is Tw . The accuracy 


of the solution will in addition depend on the coarseness of the 
lattice, For example, if it is desired to obtain the solution to an 
accuracy «=0.01 of the maximum value of f(Q), then r~100, 
N~10' and T~ 10° t. 

If it is assumed that the calculations are carried out with an 
electronic computer for which the time corresponding to a single 
step is of the order of 100 psec, then the total time for the solution 
turns out to be about three hours. 

Another important factor is that the solution will involve the 
storage of a small number of intermediate results, i.e., only the 
quantities x, y, x9. yg will have tobe stored. Specialized electronic 
computers with small storage capacity and simple construction, 
which are designed for the solution of boundary value problems 
by the Monte Carlo method, are based on this principle (1 At. 

The time required to evaluate w(P) at one site is ae 
independent of the magnitude of the grid. If it is necessary to 
evaluate w(P) at all the sites, the time of solution of course 
increases, However, it is often not necessary to evaluate w(P) at 
all the sites, but only at certain critical ones. 

This problem also illustrates the basic feature of the Monte 
Carlo method, namely, its suitability for multidimensional prob~ 
lems. 

The generalized random walk problem is related to the solution 
of the general linear eae spe equation 


a(x, y) oe + 2b (x, y) iray aaa + (x, dap 
a(x, pH tex, N+ P(e umd. 


Here, the probability that the man will reach a given site from 
another depends on the site at which he is found at a particular 
time. 

It is also possible to consider more general boundary condi- 
tions. Thus, a condition of the form 


oe |p = ku lp + f (Q) 


will be obtained if the man can, with a certain probability, leave 
the moat surrounding the city and then continue the random walk.* 


*The random walk process has a special form when se |p=Q. This is the so-called 


Neumann problem, and is discussed in Section 6, 


5. RANDOM WALK AND SOLUTION OF BOUNDARY VALUE PROBLEMS 29 


As anexample ofa nonstationary problem, consider the equation 
of thermal conductivity 


=u, (1.43) 


whose solution is a function « which depends on the spatial 
coordinates and on time. Let us assume that a lattice having a 
spacing f# is inscribed in the region D in which it is required to 
find the solution z(P, 4). This function should satisfy the boundary 
condition 


“|p=f(@Q) (1.44) 
and the initial condition 
U|,29 = € ). (1.45) 
Consider the sequence 
t=0, 1, 2. ,R, 


By choosing a suitable relation between the time scale and the 
lattice constant one can obtain the equation 


up (P) =F legs (Py) tg_1 (Pa)+-tg_1 (Ps) + 2p_-1 (Pa (1.46) 


where P,, P,, P;, P,are the four nodes neighboring P, The Laplace 
equation is obtained by removing the subscript e from (1.46). 

Consider now the following random process whose purpose is 
to find the value of u,(P) at the point P at time zg. 

The lattice used in the preceding problem can again be 
employed, except that the man will now take exactly one unit of 
time to cover the distance between neighboring sites. 

We shall assume that he starts from a crossroads P, reaches 
one of the neighboring crossroads with probability 0.25 and con- 
tinues in this way until he reaches the boundary and remains there. 
The entire process is allowed to continue for not more than k& steps. 
If after k steps the man does notfall into the moat, but reaches an 
internal site P, then he is requiredto pay a fine t= g(P). If, on the 
other hand, he does fall into the moat within the time, then he is 
required to pay a fine t= /(Q), where Q is the boundary point at 
which he falis into the moat. 

Altogether there will be N such random walks, and the total 
fine divided by Nis the approximate solution of the finite difference 
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problem forthe thermal conductivity equation (1.46) satisfying the 
conditions (1,44) and (1.45). Inorderto show this we shall calculate 
the mathematical expectation of the fine. 

Let v, (P, Q) be the probability that a man leaving P will 
reach a general point @ after & steps, It is easy to show that 
this probability will satisfy the following condition: 


O4(P. Q)= [Op-1 (Pr D+ 0-1 Pps Q + 


(1.47) 
Vp—1 (Py, D+ Vy-1 Py» Q)I- 
The quantity v,(P, Q) satisfies the boundary conditions 
Up (Q, Q)= 1, 
1,48 
0, (2. Q)=0 er 


(when Q + Q’, where Q’ and Q are boundary nodes), and the initial 
conditions 


Up (P, P)= 1, Uy(P, Q)=0, 


where P and Q are two distinct nodes (the node P is an internal 
node, while Q is a general node), 

Consider the mathematical expectation of the fine paid by the 
man starting from the point P. The magnitude of the fine assumes 
the values g(P;), g(P,); «++» &(Ps)> f(Qys f(Qy)s--+» f(Q,). It follows 
that the mathematical expectation of the fine is 


w, (PI=EVP)= Do, (P. Pig Pre 1% (P,Q)fQ). (1.49) 


Letus substitute Q—Q, into (1.47) and then multiply both sides by 
f(Q,). Next, substitute Q=P, into (1.47) and multiply by g(P)). 
Addition of all the products yields 


w,(P)= > Dy Wy (P)s (1.50) 


ial 


i.e., the mathematical expectation of the fine isthe solution of the 
finite difference equation of thermal conductivity (1.46), Boundary 
and initial conditions for w,(P) can easily be verified with the aid 
of (1.48). In fact, let P—Q,, where Q, is a boundary node, The 
right-hand side of (1.49) will then include the single nonzero term 


bed (Q) =U, (Q;Q;) f (Q;) = f (Q)). (1.51) 
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WP) = (PP; Pg (P) =8 (P)- (1.52) 


Since the solution of (1.50) which satisfies (1.51) and (1.52) is 
unique, it follows that the above method may be used to obtain it. 

A further important feature of the above method is the fact 
that it may be used to solve the problem by determining the values 
of w at any one given point, whereas in the solution of equation 
(1.36) by the usual iteration methodthe values of w must be found 
at all the lattice sites. It turns out that the time necessary for the 
determination ofa single value of w(P) isindependent of the number 
of independent variables in the problem, 


6. THE MONTE CARLO METHOD AND THE SIMULATION 
OF MARKOV PROCESSES IN A COMPUTER 


The general mathematical setup of the Monte Carlo method 
can be described in terms of the so-called Markov processes, 
Only discrete Markov processes with a finite set of states (see 
[52]), called Markov chains, are considered below. 

A system S, possessing a finite set of states M {s,, s,,.... 5}; 
is called a Markov chain. At any discrete time instant —0, 1, 
2,..., 2 the system S is in one definite state s,. 

The state s, is associated witha set of conditional probabilities 
Pi Pa» +++» Py. The quantity p,,; is the probability thatthe system, 
being at the nth time instant in the state s,, passes into the state 
s; at the(n-+-1)th instant. In other words, p,, is the probability of 
the transition s,;>s,. An essential feature of the situation is that 
the probability of transition depends only on the initial state s,, and 
is independent of the past history of the system. This constitutes 
the ‘‘Markovian’’ property of the process. 

The totality of all conditional probabilities p,, forms a matrix 
P=(p,,) which completely defines the properties of the given sys- 
tem. The state s,; is called absorbing if the probability that the 
system S remains in this state, having once passed intoit,is one. 

In terms of conditional probabilities this means that 

i if t= 7; 
ry={ 0, if ies. 

The state s; is said to be connected with the state s, if the 
probability that the system S, having passed into the state s,, 
passes into the state s; in some finite number of time intervals 
k is different from zero. 

A Markov chain is said tobeterminatingif all of its states are 
connected with some absorbing state. The following theorem may 
serve as a justification for such a term, 
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Theorem, If a Markov chainis terminating, then with probabil- 
ity unity the system S passes into one of the absorbing states in 
a finite number of time intervals. 

Proof, We shall denote by gq, (¢) the probability that in time ¢ the 
system passes from the state s, into one of the absorbing states. It 
is clear that g;(¢) can only increase with an increase of f. 

Then in accordance with the condition there exists a value 
¢=¢, such that for all i we have q,(t))>0. That is to say, every 
state of the system, with nonzero probability, can pass into an 
absorbing state in time f. 

We shall write 


I= = q; (f)- (1.53) 


It is clear that g>0. The probability, that in time ¢, the system 
does not pass into an absorbing state, does not exceed the 
quantity 1 — q. 

By virtue of the fact that the probability of transition does not 
depend on the past history, the probability of the system S not 
passing into an absorbing state in time vt, (an integral multiple 
of t)) does not exceed 

(1—g)’. 
When v—co this quantity tends to zero. 

This means that with probability unity the system S passes 
into an absorbing state in a finite time. 

Note. We shall denote by < the lifetime of the system, that is 
to say, the time it takesto reach an absorbing state. The time: is 
a random quantity, the mathematical expectation of which is 
evaluated in terms of the quantity q. 

Indeed, if the initial state is s,, then the mathematical expec- 
tation 


Mz = §i (g(t) 9, ¢—1)) 
=] 
: co (v-+1) fo 


<Yra-a¢—n<¥Y YY ta—qo+n) 


t=1 y=1 t=vfp+1 


Kt HOt =F. 


yal 


The sum of the last series is calculated by differentiating 
term-by-term the geometrical progression, whence the result 


Mz < 22 (1.54) 
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is obtained. It is clear from(1.54) thatthe mean time of transition 
into an absorbing state is greater when the probability of transition 
into the absorbing state is smaller. 

All algorithms for solving problems by the Monte Carlo 
method which have been considered above and will be discussed 
in the remaining chapters of this book can be described in the 
following manner. 

Some terminating Markov chain is being simulated. The process 
of successive transitions of this chain from state to state takes 
place in accordance with the scheme 


X—~ 8), —> 8), > 84, > 0 > Sip (1.55) 


where s,, is one of the absorbing states. A certain function F(X) of 
the sequence of transitions (1.55) is determined thereby. The func- 
tion F(X) is a random quantity. After the value of F(X)has been 
found, the system S returns to the initial state s,, and the process 
of transitions begins anew, Altogether NV independent runs of the 
given Markov chain from the state s, into one of the absorbing 
states are carried out, As a result the sum 


WFO). (1.56) 
x 


taken over all the s2quences of transitions (1.55) which have been 
carried out, is obtained. The sum (1.56) approaches the quantity 
M F(X) whose value is required in the given problem, The total 
time required for solution of the problem is 


TaN (M1 Ty (1.57) 


where Mt is the mathematical expectation of the number of transi- 
tions in the sequence (1,55), while 7,isthe mean time for carrying 
out one of the transitions in the computer. 

Thus in the setup for the solution of a system of linear 
equations, which was considered in Section 4, the fundamental 
Markov chain has n-++ 1 states, The states correspondto divisions 
of a closed interval. The probability of the transition from the 
ith state to the jth is P,, (withi<a, j<n). The probability of 
the transition from the ith state to an absorbing one is p;. The 
initial state is denoted by the number m. The function F(X) is 
defined as 


F(X) Pret Piiigs Pg_atatigy 


In the setup for the random walk problem (Section 5), the states 
of a Markov chain are identified with thenodesofa lattice. The 
passage of a wandering particle from node tonode corresponds to 
the transition of the system from state to state, The boundary 
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nodes correspond to the absorbing states of the chain, since no 
more transitions take place after the point has arrived at the 
boundary. The initial state corresponds to the node from which 
the random walk is commenced. The function F(X) in this case 
depends only on the number of the absorbing state (a boundary 
site), and not on the history of the random walk. One could use 
the relationship (1.54) for the evaluation of the wandering time, 
but in the given case a better method was derived in Section 5, It 
is clear that the lifetime of the system depends on the dimensions 
of the region. If the initial point lies far from the boundary, then 
only after many steps can the walk terminate at the boundary. 

In the case where a problem with initial and boundary condi- 
tions (for example, the equation of heat conduction) is being 
solved, the states of the Markov chain correspond to the nodes of 
a space-time lattice. In other words, the state of the Markov 
chain is defined by the pair (P, t), where P is the node of the space 
lattice, and ¢ is the time elapsed since the commencement of 
the random walk, 

Pairs such that P is a boundary node, and also pairs such that 
t=—k* ,are absorbing states, That is, after & steps the process is 
terminated if the wandering point has not reached the boundary 
earlier, In this case also F(X)isdetermined by the number of the 
absorbing state alone, and not by the history of transitions. 

Side by side with terminating Markov chains, in the Monte 
Carlo method it is also possible to use chains of another kind, the 
so-called ergodic Markov chains (see [52]). Ergodic Markov chains 
do not possess absorbing states, and for any pair of states s, and 
s; there exists a number of steps & such that the probability of 
transition from s, to s, in é steps is different from zero. In this 
case there exists, for each state s,,alimiting probability p, of the 
system passing into this state. This means that for a sufficiently 
large number of steps N thenumber of cases M, where the system 
is found in the state s, satisfies the condition 


M 
WP (1.58) 


independently of the initial state of the system. The larger N is, 
the more accurately condition (1.58) is fulfilled. In other 
words, (1.58) may be considered as a generalized version of the 
law of large numbers for the case of dependent trials. A more 
commonly found version of this law is as follows. Let ®(s) be a 


*If the value vu, (P) is being sought, 
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function depending on the state of the Markov chain* and let the 
history of the process for N steps be described by the sequence 
of transitions 


X—~ Sj, > Sj, > Sjg > 02 > Spy 


Then with probability arbitrarily close to unity, for a sufficiently 
large N, we have the approximate equality 


N 
DY 0(s,) + op. (1.59) 
k=l 
The relationships (1.58) and (1.59) make possible the use of 
ergodic Markov chains for solving problems by the Monte Carlo 
method. (Obviously, in practice it is possible to take only a finite 
sequence of transitions, fixing the quantity N in advance.) 

An example of the use of ergodic Markov chains is obtained if 
we consider the problem of the random walk in which, when the 
point reaches the boundary, the process does not terminate, but 
the point is ‘‘reflected’’ from the boundary,** 

In contrast to that considered in Section 5, this version of the 
random walk process is connected not with the problem of Dirichlet 
but with that of Neumann (the second boundary value problem), This 
problem consists of finding the solution of the equation Au—0, 


a 
to a normal vector at the boundary is zero). In order to determine 
approximately the value of the required function u(P) at some 
interior node P, it is necessary to carry out the random walk 
process which has been described, and determine how many times 
the wandering point passes through the node P, If the number of 
‘hits’? is MP, and the total number of steps (transitions) is N, 


then MP mwu(P). 


It will be observed that it is possible at the same time to note 
the number of passages through several nodes, andthus determine 
MP at once for several P, Wenote that the second boundary value 
problem defines the required function u(P) with an accuracy up to 
a constant. The method considered determines the function u(P) 
which satisfies the additional condition 


which satisfies the condition at = 0 (the derivative with respect 
T 


*It is possible to consider also certain classes of functions which depend on transi- 
tion sequences, 
**That is to say, in the interpretation suggested in Section 5, the city is surrounded 
by a wall instead of a moat; on stumbling against it the man is turned back 
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BsP)= 1, (1.60) 


where the sum is taken over all the interior nodes of the lattice. 
The condition (1.60) follows from the obvious condition 


P=, 


where p, are the limiting probabilities for transitions into the 
states of the Markov chain (i.e., for the wandering point to pass 
through the interior nodes of the lattice). 

The example presented above indicates that ergodic Markov 
chains also can be used for statistical simulation, just as can 
terminating chains, although at present the latter are used in the 
overwhelming majority of cases. 

We note that by using a digital computer we cannot in principle 
obtain anything beyond the scope of finite Markov processes, 
Therefore, if there arises a need to simulate a continuous 
stationary process, for example, then this will be successful only 
insofar as the process can well be approximated by a Markov 
chain. 

Indeed, the operation of a computer solving the usual type of 
problem can be depicted as a process of successive transitions 
of a finite automatic machine from state to state, 

At each step the state of the machine is determined by the 
contents of the registers of the machine and of the locations of the 
memory. For simplicity, we shall suppose that after the entry 
of the initial data into the machine no new information is given to 
the machine. Then the state of the machine at the following step 
is fully determined by its previous state. Now let the machine 
receive information at each step from the random-number 
generator. Then the next state of the machine is determined not 
only by the preceding value, but also bya certain random quantity. 
Thus the probability of transition into one new state or another is 
determined by the preceding state and by the probability distribu- 
tion of a random quantity. Since usually the values given by the 
random-number generator at each step are independent, then the 
transition probabilities are determined only by the preceding 
states, and not by the past history of the process. 

If we could simulate in the random-number generator compli- 
cated processes with an after-effect, then the possibilities of the 
machine would be extended. At present such an after-effect in 
random-number generators, in view of its irregularity, isa 
troublesome phenomenon which has to be avoided. 

It was mentioned in Section 1 that the most successful (and 
most common) applications of the Monte Carlo method are to 
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those problems which by their very nature are connected with 
stochastic processes, The most important part of the solution in 
this case consists of the choice of a finite Markov chain which 
is a reasonable approximation to the process. As a rule, this 
problem is reduced to the approximate representation of some 
very complicated Markov chain by a simpler Markov chain. In 
other words, there is a system Switha set of states{s,, s,.... &], 
a transition probability matrix (p,,;) and a function F(X) depending 
on the transition sequence. It is required to form a system S* 
with a smaller set of states sj, s},..., s}, a simpler transition 
probability (p; i) a more easily calculated function F*(X*) and 
(as far as possible) a shorter mean lifetime M*’ (for a terminating 
chain), so that 


MF? (X*) = MF (X). (1.61) 


We shall consider the basic methods underlying such an 
approximation, 

The first method consists of uniting some of the states. na 
number of cases it is expedient to unite certain states of the 
system S, In this case the states of the system S* are classes of 
states of the system S, This is possible if 

a) the function F(X)varies only slightly, ifinthe sequence X of 
states individual states are replaced by states belonging to the 
same Class; 

b) the probability p,, that the state s, passes into one of the 
states of a given class K is nearly the same for all states s, of 
one and the same class H. It is then possible to set p,, =~ p,,, the 
probability of the transition from class H into class K,* 

Problems concerning particles passing through matter may 
serve as an example of sucha unitingprocess, Here the state ofa 
particle is specified by the three position coordinates and velocity 
vectors (x, y, Z. U,, Uy, Uz), However, in a number of problems, as, 
for example, where particles pass through a flat layer, states 
with the same values of x and 7, can be taken as equivalent 
(and thus can be united in a single class), 

The second method consists of replacing the probability matrix 
(py) by one similar to it, in which very small values p,, are re~ 
placed by zeros. 


* It is also possible to take 


where the averaging process is carried out over the r states forming the class H, 
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The third method involves an alteration of the time scale. In 
this method the system S is replaced by a system S* having the 
same states, but considered only at time instants that are 
multiples of v: 


Q, v, 2,06 


In this case the transition matrix (p,,)is replaced by the vth power 
of the same matrix. 

The fourth method consists of approximating the function F (X) 
by a function F*(X*), For example, in the solution of the Dirichlet 
problem (see Section 5) the function f (Q) atthe boundary nodes can 
be replaced by a polynomial which approximates it in the mean. 

There are also certain more complicated methods which in 
essence are combinations of those just described. 

As yet there exists no general theory permitting effective 
approximation of a complicated process by means of a finite 
Markov chain as is done for the case of one-dimensional integrals 
by means of the well-known quadrature formulas. Normally, 
specific methods of simplification are used for each class of 
problem, To a large extent these methods form the contents 
of Chapters II and VI, 


Chapter Il 
Evaluation of Definite Integrals 


The evaluation of multiple integrals is one ofthe more impor- 
tant fields of application of the Monte Carlo method, A large 
number of quadrature formulas are available for the evaluation 
of single integrals, However, the number of such formulas which 
may be used for multidimensional integrals is relatively small. 
Moreover, the actual calculation involved in such quadratures is 
rather difficult, The integrals can often be more conveniently 
evaluated by the less accurate though simpler devices of the Monte 
Carlo method, 


1, SIMPLE APPLICATIONS OF THE MONTE 
CARLO METHOD 


In this section we shall be concerned with two simple applica- 
tions of the Monte Carlo method which may be used to evaluate 
integrals of the form 


J= f reac. 


In the first approach the average value of f (x)is calculated, and in 
the second use is made of the geometrical interpretation of the 
integral, which is looked upon as an area. 

Methods of estimating the accuracy and efficiency ofthe Monte 
Carlo method will also be given, 


1, Calculation of the Mean Value of a Function 
Let § be arandom quantity whichisuniformly distributed in the 
interval (a, 6) with a probability density p; (x) defined by 
1 
B(x)=y ome" 
0 in the opposite case. 


if x is in (a, 8), 
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The mathematical expectation of / (2) is given by 


6 
Mre@= ff) para 


Suppose that as a result of N trials the values of § obtained were 
Ei, Gg ..-1 &y. Since for large N 


N 
Mio~y DS e 
i=l 
it follows that the quantity 


N 
= 
=" Vere (2.1) 


i=1 


is an approximate estimate of the integral J, 


Fig. 3, 


In practice, the ¢, are first used to calculate/(:,), the sum 
fE)+7(&)-+-.-. +f is then evaluated, and after the completion of 
all the N trials, 6, is determined fromthe above formula, 


2. Area Calculations 


For simplicity, consider a bounded integrand 0< f(x)<c and 
suppose that (, 7) is a random point whichis uniformly distributed 
in the rectangle R 


R=lacx<cb 0<cy<e}. 
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The corresponding probability density p,,(x, y) 1s given by 


aos if (x, y) belongs to R, 


&, y= 
“s 0 in the opposite case. 
Suppose that the following N points belonging to R have been found: 
Ne 1)» Eo» Th) +--+ (Ey Ny). It is geometrically obvious (Fig. 3) that 
if N’ of these points lie under the curve y= f(x), then the ratio of 
the areas ABDC: ABFE~N':N, or 


6 
fre dx 
ee —__ 
c(o—a) N- 


The following quantity may therefore be taken as an approximate 
estimate of the integral: 


0 =c(b—a) (2.2) 


In practice, the condition 7, < f(é,) is checked for each point 
(; 4): if this condition is satisfied then unity is added to the N’ 
counter; if it is not satisfied, the count remains unchanged, The 
quantity §, is evaluated after all the Ntrials have been completed, 

It will be shown later that the estimator (2,2) is, as a rule, not 
as good as (2.1), Nevertheless, many authors (for example, 
Householder [63] and Kitov and Krinitskii [39]) use only (2.2) for 
the evaluation of integrals by the Monte Carlo method. 


3, Statistical Estimate of the Accuracy 


The accuracy of the Monte Carlo method was considered in 
Section 2 of Chapter I, 

In what follows we shall assume that the simulated random 
quantity 0 follows the almost-normal distribution 


P(0<xj}~o(2), 


¥ DO 
where (x) is given by 


f? 
O()= T= fe Fat, 


and can be used to estimate the probability of deviation of 6 from 
the mathematical expectation Mé. 
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Consider an arbitrary degree of reliability p and let x, be the 
root of the equation 
20(x,)—1l=—p. 
The probability of the inequality 


hg = Me | < Xp VD6 
is approximately equal to p, 

The value x, = 3, which corresponds to p > 0.99, is frequently 
recommended, However, in practice, this is found to lead to an 
overestimate of the amount by which @ differs from Mé, and 
therefore many authors use p=0,.95,i.e., a 95% level of reliability, 
which corresponds to x, 1.96, 

The probable error is defined by 


Bnrob = 0.675 YDB. 


The value x,—0.675 corresponds to a 50% level (p==0.5) so 
that the probabilities of the inequalities 


}6—Mb|< Sprob and |8—Mé|> bprob 
are equal, The probable error 8,;op is often used in practice to 


characterize the order of magnitude of the error. The standard 
error 


o=VD0 


is also used for this purpose, 
The majority of the estimates used in evaluating integrals by 
the Monte Carlo method take the form of the arithmetic mean 


1 N 
=" 
i=l 
of similar random quantities (, whose mathematical expectation is 
equal to the required value of the integral: 
MC, = J. 


Let Dt, =Dt. The mathematical expectation and variance of the 
random quantity 6 are given by 


Mi=J/, Dé=—, © respectively. 
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It follows from the central limit theorem of the theory of 
probability [23] that for large N the distribution of 4 is approxi- 
mately normal. Hence, with a probability approximately equal to 


p, we have 
V D 
joB—Jl|<~, NW 


In practice it is very important that it should be possible to 
estimate the variance D® easily in the course of the calculation 
of 6. To achieve this, it is sufficient to compute the square of 
(, in addition to (j itself: 


N 
aed 2 2 


Since a high accuracy is not required for Do (it is sufficient to 
have one or one and a half significant figures), it follows that for 
N > 30 it is possible to use the simpler formula 


iz cae 
= 2 
Day Yt (ds) . 


4, The Accuracy of Simple Methods 
Two different methods of evaluating the integral 


J= f separ 


were given in paragraphs 1 and 2, These methods led to the 
estimates 6, and 6,. The accuracy ofthese estimates can be com- 
pared by evaluating their variances, 

The estimate given by (2.1) is the arithmetic mean of similar 
quantities ¢ —(b— a) f (). Hence, 


Dew 
where 
6 
po = (o— a) f f(x) dx — J? 


The estimate given by (2.2) can also be looked upon as the 
arithmetic mean of similar quantities ¢°) —c(b—a)g(é, 7) where 
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»={5 if yx f(x), 
EB M=lo, if yd s(x). 


Hence, 


where* 
pt” — c(b—a)J—J2. 
Since by definition 0< f(x) <c, it follows that 


& 
f fr(x)ax cel. 
a 


Hence, in all cases 
pr < pe, 


Example, Evaluate the integral 
1 
i= f e* dx. 
9 


The exact value of this integral is known to be 
J=e—1] = 1.71828... 


Using the estimate (2.1) we have the following approximation to J 


N 
J, —= 1 iy el, 
i=l 


where 11, {.;--.are values ofa random quantity which is uniformly 
distributed in the interval (0,1). The variance of the averaged 
quantity is 


2] 


pe = +(e —1)—(e— 1)? = 0.2420. 


wane same result can easily be obtained when a binomial distribution is assumed 
for t'"’. 
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The estimate (2.2) can also be used to evaluate the integral by 
setting c—e. This yields* 


N 
A= Yeu Toi) 
where 
1 for ey<e*, 
8 Y=l0 for ey>e*. 


The variance of the averaged quantity is 
De® = eJ—J? = 1.7183. 


Numerical details of this calculation are given at the end of 
Section 2. 


5. Estimate of the Efficiency 


Consider any two ways of evaluating the integral J by the Monte 
Carlo method, It is natural to assume that the more efficient 
method is that which will allow ustoreach a given accuracy more 
rapidly. 

Let 6’ and 6” be the estimates obtained by the two methods, so 
that 


Mo’ = Mo” =v: 


i250 ely. ieee 
Di ==>; Di’ =. 


Let v’ and «” be the timesnecessaryto calculate the quantities 
tv’ and ¢’, Since the two methods are only to be compared, it may 
be assumed that +t isin somearbitrary units. The times necessary 
to achieve a given accuracy are proportional to the products 
c’-DO’ and -”.Dpt". It follows that the efficiency of the method is 
inversely proportional to the producttDt. The quantity < (and also 
D‘) may be estimated empirically froma small number of trials. It 
was shown in paragraph 4 that 


pe? <pe® 
and hence, §, is, as a rule, a better estimate than 4,. 


However, it may turn out thats, <t, (for example, if the func- 
tion y= f(x) is defined by an algebraic equation F(x, y)=0 for 


*g(loz-y To;)can be used instead of g(4» Ty4;)e The quantity 4 is uniformly dis- 
tributed in the interval (0, e) and is obtained by multiplying 7 by e. 
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which an explicit solution cannot be found); the method involving 
(2,2) will then be more convenient than that involving (2.1). 


2, SOME METHODS FOR REDUCING THE VARIANCE 
The rate of convergence of the ordinary Monte Carlo method is 


relatively low and is of the order of N72, In order to increase the 
accuracy by a factor of 10, the number N, i.e., the volume of the 
calculations, must be increased by a factor of 100. It is clear that 
an appreciable increase in the accuracy cannot be obtained in this 
way. The following methods for reducing the variance, and thus 
improving the accuracy without increasing the number of trials, 
are therefore of particular importance. 


6. Separation of the Principal Part 


Simple Monte Carlo methods are subject to relatively large 
errors, It is therefore very convenient, whenever possible, to 
determine the main part of the final result by ordinary methods and 
use the Monte Carlo method to estimate the correction, This 
principle holds for all problems solved by the Monte Carlo method, 

Consider its application to the evaluation of the integral 


J= f reoax 


and suppose that we have found a function g(x) = f (x) whose integral 
is known, so that 


f eee. 


Since the mathematical expectation of the function 


& —@—al[f@®—g@)4+/ 


is equal to J as before, it follows that it may be approximately 
estimated from 


N 
== Vis@—e@+s (2.3) 


i=l 


where §, &, ... are values of a random quantity § which is uni- 
formly distributed in (a, 8). 
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The variance is given by 
b 
Dt® = (6—a) f If (x)—g (xP dx —U—1) 


and when |g (x)— f (x)| 0, then D¢” +0, 

It is particularly useful to separate out the principal part when 
the Monte Carlo method is used to evaluate areas (or volumes) 
with the aid of (2.2). 

Suppose that the area ASDC in Fig. 4 1s equal to J, and let the 
areas of the rectangles ABFE and ABB’A’ be equal to S and S’, 
respectively. 


Fig. 4. 


By choosing random points in ABFE, and counting the number 
of them which lie in ABDC, we obtain for the variance 


pt—=s(s—J). 


If, on the other hand, the random points are selected in A’B’FE, and 
the number of them which lie in A’B’DC is determined, then another 
value is obtained for the variance: 


pt’ =V—S’)(S—J). 
Example. Evaluate the integral 


1 


Jas f edz. 
0 
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Since e*—1-+-x-+ ..., we shall take g(x) =1-+x as the principal 
part, According to (2.3) we have 


N 
1 
R=L DY (ry) +4, 
t=1 
where 1,, yj, ... are the values of a random quantity which is 


uniformly distributed in (0,1). The variance of the averaged 
quantity is then given by 


pw =te —1)(5—e) a 0.0437. 


It was shown earlier that when this integral is evaluated with 
the aid of (2,1) and (2.2), the corresponding variances are 0.2420 
and 1.7183, respectively. Numerical details for the above example 
are given in paragraph 11. 


7, Selective Sampling 


This method is based on a variable sampling procedure [173- 
175, 38, 193] in which a larger number of points is selected in the 
‘‘more important’’ parts of the region of integration, 

Let § be a random quantity with a probability densityp(x) >0, 

5 


a<x< 5b, which is normalized so that J p(x)dx =1, The required 
a 


integral may be rewritten in the form 


b 6 
j= f s@ax= LS p(x)de. 


Consider the function 


) _ F®) 


P() 


ci 


whose mathematical expectation is 
Mc) = J. 


It follows that the required integral may be estimated from 


Opa 1 (i) (2.4) 
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where {,, &,... are the valuesofa random quantity &. In particu- 
lar, when ¢ is uniformly distributed in the interval (a, 4) then 
p(x) =~ and (2.4) becomes identical with (2.1), 

The variance of the function ¢“ is 
b 
@.. | Ls. ss 
Dt =f Ee) ae — 7, 


The problem now arises as to how to select tin order to ensure 
that the variance will be a minimum, 

The minimum Dt” is achieved when the random quantity «= is 
distributed in (a, 6) with a density 


If()| 
4 
f isGoiax 


p(x) = 


In this case, 


B 


2 
pe = | iy Iyer —s, 


and if the function f(x) does not change sign then Dt” = 0, 
In actual fact the use of the random quantity | is pointless since 
h 


this would involve the evaluation of the integral [{ | f(x)|dx, which 
a 


is practically equivalent to the evaluation of / (when p(x) does not 
change sign the two are exactly equivalent), 

The practical conclusion is: the quantity — shouldbe chosen so 
that the ratio of f(x) to |/(«)| variesas little as possible, i,e., 


p(x) 


Trt = const. 


Example, Evaluate the integral 
1 


Jase [oF de, 


0 


Since e*=1-+4«-+.,,, let us consider a random quantity witha 
density p(s (1-+- x). According to (2.4) we have 
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The variance of the averaged quantity is then given by 


i 


_ 3 e* dx ae 
DY => its a 
0 


4 e-? [Bi (4) — Ei (2)] —(e — 1)? = 0.0269. * 


The values of § may be obtained from the formula 
ge VI-+37—1, 


where y isuniformly distributedin(0, 1). Numerical details of this 
calculation are given in paragraph 11. 
Remark, Suppose it is required to evaluate the integral 


6 
J= ff) pax, 


where p(x) is the probability density ofa random quantity. Here 
also it is possible to use selective sampling: instead of J/= Mf (6)it 
is possible to estimate 


jam {Feo |, 
P (1) 


where »-is a random quantity with probability density p(x). 


8. Group Sampling 


This method is well known in statistics [83], It is similar in 
principal to the method of selective sampling (more random points 
are selected in the ‘‘more important’’ regions), However, when the 
Monte Carlo method is used, selective sampling is more fre= 
quently employed. 

Let us divide the interval (a. 5) into m intervals (a,, 6,) with 
lengths /,(k—=1, 2,..., m3; Fig. 5): 


=4,<)=&<h=a,< ere < bn -1 = 8g <b = 5; 
ae ee eee 


*El (x) is an exponential integral function, See Matematicheskii analiz [Mathematical 
Analysis] (functions, limits, series, continued fractions), Fizmatgiz, 1961, p, 379, 
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> —+_ ++ 
G=a}, — byda,bea3, --- Dm=b 
Fig. 5. 


Each of the integrals following the summation sign in 


frow=J frees 


k=l a, 


will be evaluated by the simple Monte Carlo method (2.1), using 
N, values of quantity * which is uniformly distributed in (a,, 4,). 
This yields the following estimate 


4=> $7 (H", (2.5) 


k=1 * ie 
It is easy to show that 
m 72 
ED 
Mi;=J. Do= >) a 


k=1 


where D, = Df (#).* 
The total number of values of f (x) whichare used in the calcu~ 
lations is 


N=N, ANH os. +N 


It is easy to show that the minimum of the variance Dé, is 


reached when the N,=WN,|are proportional to 1,p}. In the latter 
case 


i Sar 


However, the values of D,areasarule unknown at the beginning of 
the calculation. The quantities N, are therefore frequently taken to 
be proportional to /,: 


*It is clear that 


2 
af revel —— Lr e 
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Me 1,N 
N,= b—a . 

In that case 


It is clear from the inequality 


Di; < D4; < DA, 


that in both of the above two cases group sampling is more efficient 
than the estimate 6, (realization time t+, =~ *;), 

It is obvious that the region of integration can be conveniently 
divided into parts in each of which theintegrand varies slowly, In 
the multidimensional case it is often sensible to approximate the 
boundaries of these parts by some simple function g(x) which has 
properties similar to the integrand. 

Example, Evaluate the integral 


1 
= “dx. 
J Mi x 


Let us divide (0, 1) into two equal parts and select four points in 
(0, 4/9) and six points in (1/2, 1), This yields 


4 6 
Iw 1 3/2) 
h=z ye’ tag De’ 
i iz] 


f=1 


and the variance is given by 


where 


ip. Ye 2 
Di= 2 femax—|2 fovas] = 
) i] 


e—1—4(Ye— 1) = 0.03492; 


1 1 2 
daa ftar—|2 fetae| = 


Ya ‘ls 
e®? —e —4(e— Ve) = 0.09493, 


2. SOME METHODS FOR REDUCING THE VARIANCE 53 


It follows that DJ, = 0.006138 and 2prop= 0.675(0.00614)'/2 = 0.053. 
& and ¢? may be calculated from the values of 1: 


0.57, £° = 0.5(1+ 7). 


Numerical details of this calculation are given in paragraph 11, 


9, Symmetrization of the Integrand 
Suppose it is required to evaluate 


i 
J= f f(x)dx. 
0 


We shall illustrate with examples how information about the 
behavior of f(x) may be used to reduce the sampling variance, 

Suppose that it is known that f(x) is monotonic (or approxi- 
mately monotonic), It may be expected that the symmetrized 
function 


f(O=SU+SU— 2 


will vary more slowly than f(x) (Fig. 6) and hence the variance of 


OSU M+SI 


will be smaller than Dt'(7). The corresponding estimate of the 
integral is 


N 


=a DU +S wh (2.6) 


i=l 


where 7, j, --- are the values of a random quantity 7 which is 
uniformly distributed in (0, 1). It is easy to show that 


p< pt. 
However, it does not yet follow that (2.6) is always more convenient 
than (2.1). The time for the calculation using (2.6) is approxi- 
mately twice as long as the time for (2,1), i.e., 


tga Qt. 
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Fig. 6, 


Hence (2.6) is more efficient than (2.1) provided 
pe? 0.5 pc? 


or 
1 
ff@Mfd—xae ce, 
0 


Example, Evaluate the integral 
1 


Jax f edz. 


From (2.6) we have 


The variance of the averaged function (Fig. 6) is 
De = 4 [2e-+ (e — 1) (5 —3¢)] = 0.00392, 


Numerical details of this calculation are given in paragraph 11, 

Suppose now that the function f(x) is known to have a single 
extremum in the neighborhood ofthe point x =0.5, The use of (2.6) 
would only complicate the situation. However, it may be expected 
that the function 
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e=3[f (3) +7(' -F)]- 


whose integral is equal to / as before, will be monotonic (Fig. 7) 
and can then be symmetrized in accordance with (2.6). 


Fig. 7. 


Thus 
no=a[s (5) (1-3) +15 +4) 472-4): 
and the corresponding estimate of the integral J is 


N 
naar SL (H+ 41—4)+ m 


This estimate is more efficient than (2.1) when Dt" <0.25 Dt”. 
Example. Evaluate the integral 
1 


J= [ sinzx dx 
* Tv 


0 


with the aid of (2.7), Since in this example (Fig. 7) 


f+ — 4 = 0.003871, 
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while 


1 
2\2 ] 4 
DO = } ea ee (2) => — = = 0.09472, 
0 


so that Dt”: Dt” —24.5, Hence, in this case the estimate given by 
(2.7) is more efficient than that given by (2.1) by a factor of 6,1. 

In contrast to the methods described in paragraphs 6-8, the 
application of symmetrization to multidimensional cases is not so 
convenient, It requires a preliminary transformation of the region 
of integration into a unit cube (corresponding to the number of 
dimensions), 

In particular, in a three-dimensional unit cube the analog of 
the function 


f(O= FU M+F0—2] 


contains 23= 8 terms: 


felt Ys V=ZU(H Y. D+ LAH 9, DASH 1-y. 2) 


f(x, y, 1—z)+ f(l—x, 1—-y, 2) + f(l—x, y, 1-2) + 
f(x, 1—y, l—z+f(U—+x, l—y, 1—2)]. 


Some methods of symmetrization have been given by Morton 
[208] and Bakhvalov [3], 


10. The Use of Dependent Quantities 


This method is usually confined to estimates of the form of 
(2.3), (2.6) or (2.7), which may be looked upon as special cases of 
dependent quantities, 

Suppose it is required to evaluate the integral 


b 
S= ff (*) p. (dx, 


where p, (x) is the probability density ofthe random quantity § in the 
interval (a, 6), In other words Mf (!)—J. Consider now a random 
quantity 7 and let p;,(x, y) be the probability density for the point 
(é, 4) so that 


6 
P= f Peg ay, P= f Py Wax 
a 
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(the range of variation of 7 will not be indicated), It is not assumed 
that § and y are independent, so that p,, (x, y) # p;(*) P,(y)- Consider 
now the random quantity 


c=fG)—sn 
where g(y) is also arbitrary. If /=Mg(v), then 
MC=J—ZJ. 
Thus, 
Dt=Df+Dg—2r VDF: De, (2.8) 


where r is the correlation coefficient, 


pa MAF OJ ey —M 
V DF @)-De (a) 


It is known [23] that |r| <1. When £ and y are independent then 
r=0, 

A, Consider now y and g(7) and suppose thatthe mathematical 
expectation / is known, Since 


J=MC+/, 


it is sufficient to estimate Mt in order to determine J. It is clear 
from (2.8) that DC will be small for large positive values of r (the 
absolute minimum D¢—0 is reached when y7=§ andg=f). It fol- 
lows that the sampling of ¢ is more convenient than the sampling 
of — when g(x) and f(:) have a large positive correlation, In par- 
ticular, it is convenient when 7—§& and g(x) = f (x),as was the case 
in paragraph 6, 

B. Let us now take 7 andg (7) sothat /—=—vJ, Here an estimate 
of M¢ will also yield / since 


J=>M. 


Again, the use of ¢ is more convenient than the use of § when g(x) 
and f(&) have a large positive correlation. 

For example, if p,(x)=1, as was the case in paragraph 9, then 
we can have y=é and g(y)=— f(l1—y). 

Further development of these methods has led to ‘‘random’”’ 
quadrature formulas, In fact, the function 


=i Seria dist 
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which enters into (2,7) is a quadrature formula for each£, More 
complicated functions of the form 


m 
C= 2 of i) 
can also be used, where hf, (!) are linear functions of & Such esti- 
mates have been investigated by Hammersley and Morton [149] and 
Hammersley and Mauldon [147],* It may be noted that for suf- 
ficiently smooth functions the convergence with respect to m is 
more rapid than the convergence with respect to N, and itis 
convenient to use large m and small N (evenN=1), 

Morton [208] and Bakhvalov [8] have developed analogous 
estimates for the multidimensional case. So far, they have not 
been of great practical importance because of their complexity [11], 

Another approach to the construction of random quadrature 
formulas has been adopted by Ermakovy and Zolotukhin [35], 


11, Numerical Example 
Various methods of evaluating the integral 


1 
J= fe dx = 1.718. 
0 


have been considered above. Here, this integral will be evaluated 
using all the above six methods with N=10,** The values of the 
random quantity 7 which is uniformly distributed in (0, 1) were 
taken from Table I of the Appendix, The values of random 
quantities with other distribution laws are obtained by transforming 
these values of 7. 

The formulas for the evaluation of / are as follows 


lwhen ey < e*; 


i= » Tw+s Where g(x, y)= 
2 10 & (7 T1042) &( y) Owhen ey>e*; 


*In particular, the methods A and B above are sometimes called the control variate 
and the antithetic variate methods, respectively. 
**It is usually assumed in statistics that the central limit theorem holds for N > 30, 
so when N = 10 the estimate of the statistical error may be rather different from the 
true value. N =10 was chosen in order to simplify the calculations, 


2. SOME METHODS FOR REDUCING THE VARIANCE 59 


=m Te Where t, = V1+ 37,—15 


4 6 
2(1) (2) ' 1 
4s =" +m de * ,where =z ee ant ’ 


The results of the calculationsaregivenin Table 2, The values 
of the probable errors are also given for comparison. It is clear 
from the table that the actual errors areof the order of 8y,0p- 


Table 2 


1.798 | 1.782 


0.080 | 0.064 
0.044 | 0.035 


Table 3 may be used to estimate the efficiency of all the esti- 
mates J, in the case of the ‘‘Strela’’ computer (average speed of 
3000 operations per second); «, is the time necessary to evaluate 
J, with the aid of this computer in milliseconds, 


The progress of the calculations can easily be followed from 
Table 4, which also gives the intermediate quantities, 
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12, Simple Methods 


All the methods and estimates of Section 2 can easily be 
extended to multidimensional integrals of the form 


f f(P)aP, (2.9) 
CG 


where G is an arbitrary region of a d-dimensional space, the 
points P=(x,,..., x4) belong to the region G, and dP = dx, dx,...dx4. 

Example, Some of the integrals which are encountered in 
mathematical physics (potential theory, scattering theory, and so 
on) may be reduced to the form 


i=4.f f roarag 


s § 


where the points Pand Q lie within the unit sphere S and p is the 
distance between them: 


p=|P—Q|. 
This is, in fact, a triple integral, However, it may be regarded 
as a sextuple integral and the symmetry may be taken into account 
during the realization of the random trials, 


Let P and Q be two independent random points which are uni- 
formly distributed in the sphere S, i.e., their densities inside S are 


given by 
pP)=p@=(hr). 


In terms of spherical coordinates (r, ¢, »), where pcos 4 (Fig. 8), 
the volume element is given by dP=r?drdgdy. Inview of the sym- 
metry of the problem the point P may be chosen to lie on the Oz 
axis so that 


r 


p 
3 f rdr= 
i) 


The point Q may be chosen to lie in the Oxz plane (9 =6) so that 


¥@ 


r 

Q 
f t= 3f Pdr = 15 
-1 0 
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Fig. 8. 


The resulting formulas used in the computation are then 


3-— 3 
eV Te Hg =27,—1, iV te 
pe=—Vit+4—- 220F pl a: 


The approximate value of the integral is 


N 
t= TV re. (2.10) 


j=l 


where p, is the value of p obtained as a result of the ith trial. 


13, Methods of Reducing the Variance 


The following rule must be borne in mind in addition to the 
methods of reducing the variance which were given in Section 2: 
the variance is reduced by analytical integration with respect to 
one of the variables, Thus, suppose that itis required to evaluate 
the integral 


J= f fr. QPP, QaPag, 


Vp Va 


where P and Q are points belonging to the spaces V, and Vo, re- 
spectively, and p(P, Q) is the probability density. Suppose further 
that the integration with respect to Q may be carried out analyt- 
ically, so that the partial probability density 


D(P)= f pP, QaQ 
YQ 
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is known, and so is the function j(P) which is defined by 


FOP = ff. MPP. QaQ. 
a.) 


It is clear that 


J= fF) pear. 
Vp 


This means that the calculation of the average M/ over the density 
p(P, Q)may be replaced by the calculation of the average Mf over 
the density p(P). Hence 


D/—D7= f fPE. OPP. QaPag 


Vp Ve 


— f Pw) pePyaP>do. 
Vp 


This difference is equal to the average value of the variance Dpf 
which characterizes the spread of Q for a fixed P: 


Df— Df = f D pfi (P) dP. 
Vp 


In particular, if the density p(P, Q)is constant in V, x Vg, then the 
density p(P) is constant in Vp. 


14, Integrals With Singularities 


Suppose that the region G is finite and the function f(P)has a 
singularity inside G@ or on the boundary. Itis required to evaluate 
the integral (2.9): 


f far. 
a 


Here, it is recommended that selective sampling should be used, 
with the density p(P) having the same singularity as the integrand 


F(P)- 
The simple Monte Carlo method may be used to evaluate (2,9) 


only when ;. f?(P) dP converges; inthe opposite case D§, = co. How- 
@ 


ever, when the variance D9, is finite, the inclusion of the singularity 
in the density usually leads to a reduction in the variance, 
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If the region G is not bounded, then the integral (2.9) may be 
evaluated as follows: 

1) Parts of the integral which correspond to sufficiently distant 
parts of G are rejected, and the integral is evaluated over the 
remaining finite regions, or 

2) The integral is transformed so that it corresponds to the 
case considered above, or 

3) Selective sampling is used, with the probability density 
decreasing sufficiently rapidly at infinity. 

The last-named method is clearly the most natural. The 
singularity can again be included in the density so that the 
probability density falls off at infinity at the same rate as the 
integrand, 

Example. Suppose that in the integral 


=4f [rar ag 
s Ss 


(cf. paragraph 12 for notation), the integrand has a singularity of 
the formp®, 6 will then yielda poor estimate of the integral, since 
Dé = 6o..% 

Suppose that the random point Pis uniformly distributed in S$ so 
that p(P) =3 (4x). We shall try to find the random point Q which is 
such that its density is proportional to ¢?. 

It is convenient to use spherical coordinates (p, 9, ») with the 
origin at P (Fig. 9). Consider an arbitrary direction p=cos 6 in 
the Oxz plane (gp=0), Let /=J(rp, p)be the distance of the point P 
from the surface of the sphere inthisdirection (PL in Fig, 9), and 
let p be uniformly distributed in the interval (0, /). Under these 
conditions the density is given by** 


I 
P(Q)= Fepil ° 
The computational formulas are then of the form 


°— 
rp=Vi» p= 2-1, 
t=prp,+V1—A0—p), 
P= gl. 


*The integral f fro dP dQ am fee converges, 
Sos $$ 


**A more correct notation is p (Q/P) since this is the conditional probability density, 
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(2.11) 


| where p, and /; are the valuesof p and / obtained as a result of the 
| ith trial. 


15. Numerical Example 


We shall use the estimates § and § obtained in paragraphs 12 
and 14 for the approximate evaluation of the integral 


Fines sd ees 
nn p= 
S us 


for m=1 andm=2. In this integral S is a unit sphere in three- 
dimensional space, and the points P and Q lie within S; p=|P—Q| 
is the distance between P and Q. 

The integral /,, may be evaluated analytically.* In particular, 


32 64 
ar. ly= 3. 


Lh=4. h= 35 


A, Various estimates andtheir variances. Using (2.10) we have 


*When m <2 


‘_o 32 aay 1 
m = —m) (© — m) (ory Qe+ 3’ 
where E (x) is the integral part of x, —_ 
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The corresponding variances are 


More explicitly: 


0 
1 1 t 1 
a ; f / 162 f dQ 
—a- | dr d do = —— | redr | —~-= 
oar My pee Bon, e 


0 s 
162 1 ‘a i dPdQ_ 32 es 
“2° R.2 ee eg ee 
e ° 3 
ss 


~ 1 16 1 16 \2 
a=W Pls th=a[M is) of 4) 

It is interesting to note that D/, > D/,, andhence it is clear that 
if the integrand f(p) in the example of paragraph 14 does not have 
a singularity of the form p~then§ may be a worse estimate than 6. 
It follows that an incorrectly used selective sampling may yield an 
inferior result as compared with the simple estimate @. 

Next, it is easy to see that a statistical determination of p is 
not required for the evaluation of hi In other words, it may be 
considered that integration with respect to p in /, has been carried 
out analytically, An analogous estimate may be obtained for the 
integral /,: 
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In fact, 


It is clear thatD/, < Di. 


67 


B, Calculations, The integrals /, and J, were evaluated by all 
the above methods for N = 10 (cf, footnote onpage 58), The random 
quantities 7 which are uniformly distributed in (0, 1) were taken 


from Table I in the Appendix. 
The formulas employed were as follows: 


3r-— 3 
r= i re=V1 
Pe=2t—1 pb =2t,—1 

3 ——$ $<. 
rg=V 1s L ==prpt Vi—r— p>) 
p =VA+F— worl o p =a 

16 ~ 1 16 ." 
i = 90 J m 1 = 30 oli 

i=l i=l 

16 WI 16 \ 

1 =" ly 2 = 39 ak 


The results are given in Table 5 together with the values of the 


probable errors &prob- 


The intermediate quantities which indicate the progress of the 


calculations are given in Table 6, 
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Table 5 


Result 


C, The values of Table 6 may be used to obtain an empirical 
estimate of the variances of /,and (cf, paragraph 2): 


10 
a 1 1 /16\2 1 = 
Dizzy E (~) YT] =o, 


10 


f=1 


The true values are respectively 0.26 and 0.68, The accuracy of 
the empirical value of D/, is quite sufficient for the estimation of 
8prob: However, the empirical value of D/, is not satisfactory 
because of the low accuracy of /,. 


16, Monte Carlo Method With Increased Rate of Convergence 
Consider the evaluation of the integral 


J= ff) dP, (2.12) 
K 


where K is a unit cube in the d-dimensional space 0< +), x), ..., 

xqa<1, the point P=(x,,..., *,)belongstoX anddP=dx, dx... dx4. 

The simple Monte Carlo estimation given by (2.1) can be used to 

evaluate (2.12) only if the integrand f(P) hasan integrable square, 
1 


The order of convergence is then NV 2, 

Quadrature formulas cannot be constructed for such a wide 
class of functions (cf. paragraph 17), However, quadrature for- 
mulas giving a higher rate of convergence are available for more 
restricted classes of functions. The known properties of the 
functions in a particular class are used to construct such formulas, 
and it is natural to expect that the same information may be used 
to develop Monte Carlo methods giving a more rapid rate of 
convergence, 
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Consider a set of functions for whichall the first order partial 
derivatives 


of of of 


Ox,’ Ox,’ """’ Oxg 


are continuous and bounded in K, There are quadrature formulas 


which ensure convergence of the order of N @ for this class of 
functions, There are also Monte Carlo methods which ensure 


1 

convergence of the order of nG@*2) for the same class of functions.* 
Bakhvalov [3] has shown that these ratesof convergence cannot be 
improved upon, The difference between them is due to the fact that 
in the case of the quadrature formulas it is the maximum error 
for the particular class of functions whichis being estimated, while 
in the case of the Monte Carlo method an estimate is made of the 
probability of the error for one of the functions in the class, 

Asan example ofa quadrature formula ensuring convergence of 


the order of NV 4 we may take the formula 


N 
1 
Izzy > f (P)s (2.13) 
i=l 
in which the nodes P; form a uniform (cubic) lattice in K. This 
means that the cube K is dividedinto N = n* equal cubes K; (Fig. 10) 
and the point P, is taken at the center of K;. 


Fig. 10. 


Consider now the formula given by (2.13), but suppose that the 
point P; is a random point which isuniformly distributed in K;, It 
is easy to show that 


*In each case the integral is estimated using values of the function at N points, 
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N 
eae 


i=l 
Dr Ye} < ae 


where the constant A depends only on d and the maximum of the 


moduli of the partial derivatives* Ge . 
J 
It follows from Chebyshev’s inequality that 


i 


Ad 
Hence, in this case the order of the convergence is vG “a ) 
For eas : this method of calculation is hardly appropriate, 


> 1—e?. 


N 
a yf Pp—s]< A 
j=l 


nN? 4 


since t+ a my . It is better to use the simple formula (2.1), 


The Monte ‘Ceilp method involving increased rate of converg- 
ence has not been widely used in computational practice, It is 
probable that this is due to the fact that the multidimensional 
quadrature formulas which should serve as the basis for such 
Monte Carlo methods are very complicated. 


17. Practical Remarks 


A, Some properties of the Monte Carlo method. The first 
property is the probabilistic nature of the convergence. Although 
1 


the order of convergence, N ?, is nothigh, it is independent of the 
order d of the integral (cf. B, below), Moreover, the probable error 
of the calculations may be estimated in the course of the calcula- 
tions themselves, Ifthe result isrequiredtoa low accuracy (of the 
order of 5%) then this convergence is quite acceptable. 

The second property of the method is its simplicity, which 
facilitates programming fast computers, 

B. On the application of uniform nets. Uniform nets are fre- 
quently used in computational practice to evaluate multidimensional 
integrals over a unit cube X, Suchnetsare simple, and are simple 
generalizations of the procedure used with single (one-dimensional) 
integrals. However, uniform nets are not suitable in the case of 


*A= wey 2 Ld, ee sup | 7 |. 
j 
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large d [56], At first sight this would seem to contradict the state- 
ments made in paragraph 16, where it was shown that uniform 
1 


nets ensure the best order of convergence \N ¢ ) for the class of 
functions having continuous and bounded first-order partial 
derivatives, It is possible, however, to restrict the class of func- 
tions somewhat, and require that all the partial derivatives con- 
taining no more than one differentiation with respect to each 
variable should ibe continuous and bounded. In other words, the 
derivative ye CORREO x f aoe 
be obtained by differentiation should be continuous and bounded. 
For this class of functions it is possible to construct a net in the 


cube K which will ensure convergence of the order of inn ; while 


and all the derivatives from which it can 


1 
the order of the convergence for uniform nets is N ¢, as before, 
For large d the difference is very considerable. 

C, On the application of the Monte Carlo method. Consider 
the following three cases: 

1) The function f(P) is ‘‘sufficiently smooth’”’ and the region G 
is *‘sufficiently good.’’* Such integrals are, in practice, evaluated 
by the Monte Carlo method for d> 4 and occasionally for d> 3. 
However, this is apparently a temporary situation: it is possible 
to construct quadrature formulas for various classes of such 
functions which have a faster rate of convergence. When such 
sufficiently simple and universal formulas are found, the Monte 
Carlo methods will become less convenient (cf, Section 5), 

2) The function f(P) is “piecewise smooth,.’’** The integrals 
of such functions are evaluated by the Monte Carlo method for 
d>2. In fact, in the case ofa function of a single variable, the re- 
gion in which it is defined may be divided into a number of intervals 
in each of which a suitable quadrature formula may be used, How- 
ever, when d = 2 the division of a square into curvilinear figures 
and the transformation of each of them into a standard region for 
which adequate quadrature formulas are available is a complicated 
matter which is very inconvenient in computer practice, On the 
other hand, the computational process in the case of the simple 
Monte Carlo method does not depend on the position of the lines 


*It may be imagined that G maybe easily transformed into a unit d-dimensional 
cube K and the function f (P) has continuous partial derivatives of order up to and in- 
cluding d, 

*kThis includes the case of a "poor"' bounded region G since it can always be in- 
cluded in a cube by adding a null function, 
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of discontinuity, The Monte Carlo method has, so far, been the 
most convenient method for such integrals, 

3) The function f(P)is ‘very poor.’? There are broad classes 
of functions for which quadrature formulas with convergence of 


order better thanN ? cannot be constructed. In the one-dimensional 
case, functions of this class are, for example, those satisfying the 


Lipschitz condition of order «a fora < ot 


f(x) —F (x) |< L| x’ — xf. 


Analogous classes of functions H, for functions of a number of 
variables are given by Sobol’ [60], 
If the only thing which is known about a function /(P) is that it 


belongs to a class whichis broader than H, fora < é (in this sense 


it is “‘very poor’’), then as a rule its integral is best evaluated 
by the Monte Carlo method (even for d= 1), 
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Many important problems in quantum physics and theory of 
probability can be reduced to the evaluation of Wiener* integrals 


f F [x|dwx, (2.14) 


¢ 
where C is the functional space of all functions x — x (t) which are 
continuous in [0, 7] and satisfy the condition x (0)=0, and F [x] is an 
arbitrary (continuous and bounded) functional defined in C. 


18. Two Methods of Evaluating Wiener Integrals 


The integral (2.14) may be evaluated approximately by replacing 
it by an integral of finite dimensions and sufficiently high multi- 
plicity d. Infact, let us divide [0, T] into d equal parts by the points 


OHA lt <i... CH=. (2.15) 


*We shall consider only the integral of Wiener measure, In determining this meas- 
ure, the derivative should be made equal to!/,, which can always be arranged by 
changing the time scale, In this way, the probability density of x (f) is 


en (¥7/f) = 


Ps (x) = rt 
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Each curve x/(f) will be replacedbya broken line x (¢) which inter- 
sects x(f) at all the dividing points: 


x (t) = x (t,) = x;. 


The value of the functional F [x] on such broken lines can be looked 
upon as a function of d variables: 


FP [x] =F (Sys oc0, Qs 
Hence, 
f ria dyx= 
é 
Pe 
d -P ~ “FG i-v 
jim (7) ie f Pee nxae 7 ax,,...dxXq. 


The multidimensional integrals on the right hand side may be 
evaluated by any method, including the Monte Carlo method, 

A somewhat different approach is also possible, Wiener’s 
measure in the space C defines the probability density for the 
trajectories of a Brownian particle satisfying the initial condi- 
tion x (0)=0. Hence, the integral (2.14) represents the mathematical 
expectation (mean value) of the functional F [x] over the complete 
set of such trajectories. For the approximate evaluation of (2.14) 
by the Monte Carlo method itisnecessaryto realize a sufficiently 
large number N of Brownian trajectories x!! (4), x@(#),..., xf, 
find the value of the functional for each of them, and average the 
result; 


N 
f Flx|dwx~ YF lx). 
c 


k=1 


The realization of Brownian trajectories can also be approxi- 
mately carried out by replacing them by broken lines. 


19, Approximate Construction of Brownian Trajectories 


In all the formulas given above, the quantities [,, ¢,,... are 
normal random quantities with a meanof zero and a variance of 1. 
The first method is based directly on the definition of the 
Brownian motion, The interval [0, T]is dividedinto d equal parts 
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by the points defined by (2,15), Thevaluesof x(¢,) are determined 
statistically using the formula 


ca ° 
xt=x¢.+V & ‘sj? $1, Zo sine Gs 
and the points of the trajectories, which are obtained as a result, 
are connected by straight lines, 


The second method [20] is based on the fact that if the values 
of x(s,) and x(s,;) are known, then the conditional distribution of 


x (5) is normal with a mean equal to Sx (s,)-++ x (s,)] and a 
variance of s1a—s |. 


The computational formulas are 


x (T)= +t 

=ti2@+eN+V 24 

(E)q 3[*@+x(z)| fy a ( 
( =4[x(fZ)+<]+- V7 Ze. 
] 

=3 


[s@++(F)]+V o& 


Having determined 2” points of the trajectory, the points are con- 
nected by straight lines. 


20. Numerical Example 
Evaluate the integral 


ftxWawe=4 
c 


where 
a | 
xP f oat. 
0 
Since 


Ditsii= fixitewe- (4~=5—-La=t, 
c . 
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it follows that for N=10 the probable error is 
dprob = 0.675(0.00833) = 0,062 


(in addition to this statistical error, there is also the further 
error due to the replacement of continuous trajectories by the 
broken lines), 

The Brownian trajectories will be constructed by the second 
method using four points and the normal random quantities given 
in Table II of the Appendix, The number of trajectories is N=10 
and the computational formulas are 


x (1) = 0.70711€,; 
x (Ha) = 0.5.x (1) ++ 0.353550; 
x (iq) = 0-5x("/.) + 0.2565; 
X P/4) = 0.5 [x (Ya) x (1)] + 0.25,; 
Nell? = gy ( 2x7 /q) + 2x? (Mp) + 2x? Cy) + 22 (1) + 
% (4) ¥ fo) + ¥ Cfo) x fy) -+ x Ci, (1) }; 


10 
2 1 2 
J lelldwx 5 Dy lll? 
i=l 


The results of the calculations are given in Table 7. The first 
three trajectories are shown in Fig, 11. 


x(t) 
4 


Fig. 11. 


The final result (0.2585) turned out to be exceptionally good (by 
chance), For NV = 9 the result is 0.2075, for which the error is of 
the order of éprob- 


*The following formula was used: If the function f(x) is linear in (4, }) then 


b 
[Pema rot @sO+ron 
a 
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The variance can easily be estimated from data givenin Table 7: 


D { i) xll2} = g (1.4774 — 10 - 0.25852) = 0.090 


The exact value is 75 = 0.083. 


5. APPLICATION OF QUASI-RANDOM POINTS 
TO THE MONTE CARLO SCHEME 


It was noted in Section 3 (paragraph 17) that for “‘good’’ func- 
tions there exist multidimensional quadrature formulas ensuring a 
better rate of convergence than the Monte Carlomethod, A simple 
method for the construction of such formulas will now be given 
141, 60]. This method can be generalized toan infinite number of 
dimensions [61] and preserves the Monte Carlo scheme of calcula- 
tions, except that the random quantities are replaced by quasi- 
random quantities, 

Before we explain the method we must define the number p, (i), 


21, The Numbers p, (i) 


Consider a positive integer r > 2. We shall give three equivalent 
definitions of p,(é). 
Definition 1, If in the number system with scale of notation r 


$= 2,01 +++ AQqs 


then (in the same system) 


P, (i) = 0, Gy Ay... By 14m: 


The quantities a, are the integersO<a,<r—l1, More explicitly: if 
i=agr™ +ag-yr™— +... tayrtay, 
then 
PrQ=2BtA t+ oe toe 
For example, let r= 2, i=11. Since in the binary system 


i= 1011, it follows that p,()) 0.1101, or in the decimal system 
Po(l tt. 
. 16° 
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Definition 2 (recurrent), 
1 
a) p, (1) — -? 
b) If in the number system with scale of notation r 
DP, (i) =0, apa, ..- Ag-14,00 ..., 


then in order to obtain p,(i-+-1)itisnecessary to find the smallest 
number & which is such that a, < r—1;a,is then replaced by 1+ 4, 
and all the figures with smaller numbers (if they exist) are re- 
placed by zeros; figures with numbers greater than k remain 
unaltered, 

This rule may be written in the form 


P,@+1)=p,(+r-*+r-*— 1. 


For example, in the binary system p, (11)=0,1101. In this 
case, k= 2. Hence, p, (12) = 0.0011 or py (12) = 3/16. 
Definition 3 (recurrent over groups), 


a) p,(r*) ==r7"-1 whenn= 0, 1, 2,...; 
b) p,(r?+-/) = p,(r")+ p,(/) when 
J=1,2,..., 20° —r*—1,. 


All the numbers 7,(/) are rational and lie between zero and 
unity. For given r the sequence of points p, (1), p,(2),...,7?,() is 
uniformly distributed in the interval (0, 1) (in the number- 
theoretical sense),* 

Some numerical values are given in Table 8. 


Table 8 


22, Evaluation of Multidimensional Integrals 
Suppose that K is a unit d-dimensional cube as before, 


P=(x, Xg...,%g)is a point in the cube, and dP = dx, dx, ... dxy. Con- 
sider d relatively prime integers rj, ry, ..., 7. 
*The sequence Pp, (1), P2(2),..., P2(i),-.., was apparently first given by van der 


Korput and independently by Sobol” [56]. Generalization to arbitrary r is due to 
Hammersley, 
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Halton’s sequence will be defined as the sequence of points 
Pi, P3, .--, Pj. ... in the cube K whose coordinates are given by 


Pi=(P,, Os Pr, Os «+02 PO): 


This sequence is uniformly distributed in the cube K (in the 
number~theoretical sense), 

In the theorem given below, an estimate is made of the error 
in the quadrature formula 


[ 1@aP~a 1¥ (0). 


i=l 


Definition, A class W,(L) of functions is defined as the set of 
functions f(P) whose partial derivatives containing no more than 
one differentiation with respect to each of the coordinates* are 
piecewise continuous and bounded, so that 


omf (P) 
OxX:, Oxi, wee OX; <b, 


where 1<i;<in< .-. Ci,z<d; m=1, 2,..., 4. 

Theorem. There exists a constant B (which depends on r,, 
Ty} +++, fq) Which is such that for any /(P) belonging to the class 
w, (L) the inequality 


In? 
N 


(2.16) 


N 
frmaw—zDsP) <LB 
K i=l 


will hold for all integral N, 

So far, it is not clear what is the best method of choosing 
Tis Ta, «++, ‘ge Known estimates of B merely indicate that small 
constants are the most convenient, 

For sufficiently large N, 


ri 
B= ea “nr; t 
may be used in (2.16), 
mAmeng the derivatives covered by this definition the highest order derivative is 


Of 
Ox, 0X2... Og" 
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g(x) 


Fig, 12. 
Example, Evaluate the integral 
i a} 
J= ff ff» aaxayaz, 
000 
where f = g(x) g(y)g(z) and the function g(x)(Fig. 12) is defined by 
4x, 0<xed4i 
1 a 
2— 4x, Tt <*<53 
4x2, x di 
4—4x, 2Exd1. 
The exact value of the integral is 
1 3 
J= [few ax | = 0.59= 0.125. 
Let ry, Pa %a, be equal | to 2,3, 5 and let us estimate / from the eight 


points Pj, P;,..., Ps. Using the values of p,(é) given in paragraph 
21 we obtain the values of g(p,() which are given in Table 9, 


Table 9 
i 1 2 3 4 5 66 7 8 


g(r) O 1 1 % Y/, 1%, iy Y, 
& (Ps (i)) %, Ys ¥, % ¥, 9 % % 4, 
gal) % 4% % ¥, 5 Vos wy, 5 Vag 14, 5 
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The approximate value of J is 


8 
YE ( Po) g (Ps) g (p5()) = 0-125 + 1.120 = 0.140. 


t=1 
23, Evaluation of Infinitely Dimensional Integrals 


In this paragraph K will denote the infinitely dimensional unit 
cube 


K=(0<4,<1, 0%) <1, ..., OS 2, <1, «oe 


A point P of this cube is defined by the denumerable sequence of 
coordinates 


P=(z; Xa» seer Xs, ry 


and dP =dx,dx,...dx,... is an infinite product, Let us enumerate 
all the prime numbers; r,<r,< ...<7rj<... (ie., = 2, ~=3, 
rg= 5, r1y=T, ...)- 

The generalized Halton’s sequence is defined as the sequence 
of points Pi, P;,..., P;,... of the cube K whose coordinates are 
defined by 


Pi = (Pp, Os Pry De +002 Br, (De o0+)s 


This sequence may be used to evaluate certain infinitely 
dimensional integrals. Without giving a precise definition of the 
class of functions, we shall merely note that the piecewise con- 
tinuity and boundedness of all the partial derivatives containing 
not more than one differentiation with respect to each coordinate 
are not sufficient.* A further necessary condition is that the 
dependence of f(P) on x, must decrease sufficiently rapidly with 
increasing s, This was first pointed out by N, N, Chentsov. 

For such functions it may be shown that 


| [1-230 =0(-jnr): 


icl 
where « > 0 is a constant which is as small as desired, 


*In this requirement are included the derivatives of any order, 


5. APPLICATION OF QUASI—RANDOM POINTS 83 
24, The Points P; as Deterministic Quasi-Random Points 


In computational Monte Carlo practice the realization of the 
values of any random quantities is usually carried out by trans- 
forming the values of a random quantity 1 which is uniformly 
distributed in the interval (0,1) (cf. Chapter IV), Thus, instead 
of calculating f (&, &,. .... &), where (6, &, ..., 4) isa random point 
having a given distribution law, one evaluates some other function 


Gp bos eee my) =F (7, 1), ene tT) 


in which 7, 72, ... are similar random quantities which are uni- 
formly distributed in the interval (0, 1). Generally speaking,n>d. 

Thus, the calculation of the average value of / ina given 
d-dimensional region is replaced by the calculation of the average 
value of the function F over ann-dimensional unit cube. The point 


T= (y, 1), es alece ) 


is a random point which is uniformly distributed in this cube, 

According to paragraph 22, the average value ofthe function F 
over the n-dimensional unit cube may be calculatedover the points 
of the Halton sequence, Hence, in some calculations the sequence 
of random points l,, T,, ..., which are uniformly distributed in 
the unit cube, may be replaced by Pj, P3,.... If the function F 
obtained as a result of this procedure turns out to be sufficiently 
smooth, then the method may ensure a more rapid convergence, 

n 
namely, a instead of N ?, 

The solution of a number of physical problems by the Monte 
Carlo method involves the simulation of elementary particle 
trajectories (cf. Chapter III), Theoretically, each of these tra- 
jectories is defined by a denumerable sequence ofvalues 


4), A, wees Tr vee 


and may be looked upon as a point of an infinitely dimensional 
cube, although in practice the calculation of the trajectory is cut 
off, The calculated quantities are functions of the trajectory; they 
are calculated for each trajectory and are then averaged over all 
trajectories, 

It is clear that instead of the random, infinitely dimensional 


points (7, 7°, ..., 76...) it is occasionally possible to use the 
points of the generalized Halton sequence, This means that the 
numbers Pr, Ds Pry Ds ves p, (i), +». may be used instead of the quasi- 
random numbers 7, Y.-..+» Ys +--+» to construct the ith trajectory. 


It follows from paragraph 23 that insome problems the number of 
trajectories required will be reduced when this method is employed, 


Chapter Ill 


Applications of the Monte Carlo Method 
in Neutron Physics 


1. THE MONTE CARLO METHOD IN ELEMENTARY 
PARTICLE PROBLEMS 


1. Introduction 


The result of interaction between a large number of particles 
must be taken into account in many physical problems. The laws 
of elementary interactions (microscopic laws) are known from 
experiment or can be predicted theoretically. However, the 
macroscopic characteristics of matter (for example, the density) 
must be known for practical purposes. 

The classical method of solution of such problems is based on 
equations which are satisfied by the macroscopic characteristics. 
Diffusion equations are an example of this type of approach, and 
well developed numerical methods are available for their solution, 
In other problems it is necessary tousethe equations of transport, 
or the kinetic equations, for which numerical methods of solution 
have been developed only insimple cases. Finally, there are many 
problems for which macroscopic equations are not available at all. 

The Monte Carlo method may be used for the approximate 
calculation of the required characteristics without recourse to 
macroscopic equations. 

The method of macroscopic solutions will, in general, yield a 
considerable amount of information, for example, the asymptotic 
behavior, approximate relationships, and so on, The Monte Carlo 
method is a numerical one and should be compared with the 
numerical methods for the solution of macroscopic equations 
rather than with the method of macroscopic equations itself. In 
many problems, and particularly in complicated problems, the 
Monte Carlo method has many advantages as compared with the 
classical numerical methods, apart from the fact that in many 
cases the macroscopic equations are not known, 
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2. Simulation of a Physical Process 


Suppose a beam of particles enters a region G of a medium. 
Consider one of these particles and suppose that the distribution 
of free paths is known, so that it is possible to choose a free path 
and find the point of collision of the particle with an atom in the 
region G. 

If the region G contains a mixture of media, then it is possible 
to determine with which particular kind of atom the particle 
interacts, because the probabilities of collision are proportional 
to the amounts of the various atoms present. 

The incident particle may cease to exist, i.e., it may become 
absorbed, or may be scattered, i.e., it may assume a different 
energy and direction of motion, If the particleis a neutron, then a 
collision with a fissile nucleus may give rise to a disintegration 
of the nucleus and to the appearance ofa number of new neutrons. 
If the particle is a photon, it may give rise to the appearance of 
an electron-positron pair, or to the appearance of a free electron 
(photoelectric effect), and so on. The probabilities of the various 
interactions between incident particles of a particular kind and the 
target atoms are known: they are characterized by the so-called 
interaction cross sections, or partial cross sections. It follows 
that the type of interaction occurring in a given collision can also 
be determined. 

The history of each new particle can be followed in a similar 
way. As a result, a branched trajectory is produced in the region 
G whichis sometimes calledacascade or shower. Figure 13 shows 
this type of effect, andis, for the sake of simplicity, confined to 
two dimensions. It is, in a sense, the genealogical tree (p-photon, 
e~-electron, e+-positron), In general, such showers will be in- 
finite, although in practice a finite number of branches is taken. 

The point at which the shower must be cut off is usually clear 
from the conditions specifying the particular problem. 
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Consider now a number of incident particles large enough to 
give a good representation of the part of the incident beam which 
enters the region G, and let us construct for each of the particles 
a diagram of the form shown in Fig. 13. All the characteristics 
which are of interest may be approximately determined from this 
set of diagrams, for example, the number of particles of a given 
type at different instants of time, the number of interactions of a 
particular type, the energy and angular distributions of the 
particles, the amount of liberated energy, and so on. 

This is the basic scheme for the application of the Monte Carlo 
method in elementary particle physics. A more detailed analysis 
would require the inclusion of elementary (microscopic) laws which 
are characteristic of the particular type of problem. 

In the present chapter we shall consider some problems in 
neutron physics, which is the main field of application of the Monte 
Carlo method in physics. However, the methods which will be 
described in Section 3 can be used, for example, in the study of 
the propagation of photons, the only difference being the specific 
elementary interactions and the laws describing them. 


3. Organization of the Calculations 


One of the methods isto recordallthe showers and then analyze 
them, i.e., select the required data and carry out the necessary 
calculations. This involves recording the coordinates and times 
of all collisions, types of particles, their origin, velocities, 
energies, and so on. The advantage of this approach is that it will 
conserve all the information implicit in the data. The disad- 
vantage of the method lies in the fact that it is necessary to store 
a large amount of data (problems in which the number of showers 
is less than 50~100 are not frequently encountered). 

Since modern electronic computers have a relatively small 
internal (fast) memory, the above method is not as a rule con- 
venient for them. In actual fact each shower is constructed 
independently of all others, The analysis is carried out for each 
shower in turn so that the results of the analysis gradually 
accumulate. In this method the number of operations involving 
the external (slow) memory is considerably reduced. Moreover, 
it is not as a rule always necessary to construct the shower first 
and then analyze it. It is possible to carry out the analysis as the 
shower grows, so that the internal memory need not store the 
entire shower. The following are two methods for such gradual 
analysis of a shower. 

A. Genealogical analysis. This method is convenient when the 
branches are long but the degree of branching is not too high. 
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Particles of a given generation are used to determine the next 
generation and the constructed part of the shower is analyzed. 
This means that not more than two generations need be stored in 
the memory. 

It should be noted that the term generation may refer to any 
particles which are not each other’s ‘‘ancestors.’’ For example, 
the particles joined by the dashed line a in Fig. 14 may be looked 
upon as belonging to a given generation. The next generation is 
then unambiguously determined and is indicated by 6. 

B. Lexicographic analysis of a shower. This method is con- 
venient when the branches are not too long but the degree of 
branching is large. In this method a particular branch is followed 
right to the end and all the side branches are noted. Having 
reached the end of the particular branch the path is retraced to 
the preceding site and the final section is ‘‘cancelled.’’ The 
procedure is then repeated, starting from any of the recorded 
sites. If, at a particular point, there are no recorded branches, 
then the path is retraced through one further section and the latter 
is cancelled. The count is completed when the entire shower is 
cancelled. 


The numbers in Fig. 14 indicate the order of the lexicographic 
tracing of a shower. Itisclear thatat any given time the computer 
memory will store only one complete or incomplete branch of the 
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shower, and all the offshoots along it. Figure 15 shows the part 
of the shower which is stored in the memory when site 11 is 
reached. A program which may be used with the lexicographic 
method has been given by Golenko [25]. 


Fig. 15. 
4, Estimated Accuracy 


In ordinary computational practice the error in a complicated 
calculation can hardly ever be calculated from a theoretical 
formula. The accuracy must be estimated from various additional 
sources of information, for example, comparison of results ob- 
tained with different grids and methods, analysis of special 
problems for which solutions are known, and so on. Theoretical 
estimates of the errors {including estimates of convergence) 
usually play the part of additional sources of information. 

A rigorous estimate of the accuracy is impossible in the case of 
the Monte Carlo method. However, the various additional sources 
of information mentioned above do remain significant. In this 
sense, the Monte Carlo method does not differ so very much from 
the classical computational methods for which a precise deter- 
mination of the errors is possible in principle. 

The Monte Carlo method is frequently foundto have advantages 
even in comparison with classical methods, since even in very 
complicated calculations it is often possible to obtaina statistical 
estimate of the accuracy. In fact, most of the results obtained by 
the Monte Carlo method arethe mathematical expectations of some 
random quantities § which are functions of the shower. The prob- 
able relative error is then given by 


3. pe 087 DE 
prob" "we VN ' 


where Dé is the variance and Nisthe number of trials (showers). 
This quantity is a reasonable representation of the order of the 
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statistical error (cf. Chapter II). In order to estimate Dé, both 
dé and Se must be determined. 
@) 7) 

When the quantity under investigation is morecomplicated (for 
example, kes in Section 4) some information about the error may 
be obtained by examining the variation in the result with increas- 
ing N. For example, Bouquet et al. [84] assumed that the deviation 
from the exact value follows a normal distribution when N> Np. 
However, this type of approach has not been adequately justified. 


5. Introduction of Statistical Weights 


Statistical weights are frequently used in the determination of 
particle trajectories by the Monte Carlo method (they are some- 
times referred to as fictitious masses), The rigorous mathe- 
matical introduction of statistical weights (see the following 
paragraph) is not particularly instructive. Inpractice, the weights 
are usually introduced on the basis of elementary physical ideas 
which may be illustrated by an example of weights describing the 
escape of particles from a finite region. 

Consider the trajectory of a particle ina finite convex region G. 
The free path / of the particle will be regarded as a random 
quantity with a probability density p(x) which is proportional to 
exp(—x/a), where 0 < x < co. Let x* bethe distance from the point 
of creation of the particle to the boundary of the region G along 
the direction of motion. 

In the usual method of constructing a trajectory the length / is 
determined with the aid of the formula 


t 
f p(x)dx = 7, 
0 


where 7 represents a value of a random quantity which is uniformly 
distributed in the interval (0, 1). When/ > x* the particle is con- 
sidered to have left G. It is possible however to calculate the 
probability of escape of a particle from G from the formula 


oc 
j p(x) dx = e-*"a, 
x 


and then proceed as follows. Let the particle be replaced by a 
bunch consisting of a large number w of particles, so that the 
number of particles escaping from G is w exp(—x*/a) while the 
number of particles remaining inside Gis w’ = ah —exp (—x*/a)]. 
The free path 7 for the remainder of the packet inside G may be 
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determined by assuming that the density remains proportional 
to exp(—x/a) as before, but withO <x <-.x*. The free path / can 
then be determined from 


T x 
f p@dx=1 f pax. 
0 Q 


This approach is usually employed in practice. The number of 
particles w is in fact the statistical weight. 

It is clear from the above discussion that instead of a single 
particle with a statistical weight w it is possible (beginning with 
any point on the trajectory) to take m similar particles with 
weights w/m and determine the history of each of them by the 
above method. In other words, it is always possible to branch out 
the trajectory and increase the number of particles. 

The converse procedure is also possible. It involves the con- 
sideration of a fictitious collision in which a particle ‘‘disappears’”’ 


with a probability 1 — + or assumes a weight mw with a proba- 
: 1 

bility =. 

particle becomes so small that it cannot conveniently be followed. 


Another example of the introduction of weights with the aid of 
the above methods is given in paragraph 14. 


This device is used when the statistical weight of the 


6. Mathematical Theory of Statistical Weights 


The mathematical basis for the introduction of statistical 
weights is the method of selective sampling (cf. Chapter I). In 
fact, suppose that we are concerned with the random quantity = 
which has a density p(x), a<x <6. Itis requiredto determine the 
average value of a function f(t) in the interval {a, 6]: 


6 
Mf @= f f() p(xae. 


Instead of § one can take some other random quantity » which has 
a positive density p(x), a<x<. This willinvolvethe calculation 
of the average value of another function, i.e., 


Fma=srmMe@, 
Py) 
since 


a) 5 
Mi (i) = f 7) pax = f f() p~)ax =MS 
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At the same time the variances of thetwo quantities are different, 
since 


6 
MPG) = f/x) p(x) dx, 


while 


& 
79 (on) — , P(x) ] 7 
mPco= f P(E] eas 
If p(x)/p(x) <1 in the part of the interval (a, 5) which contributes 
most to M/?(:), then itis to be expected that the replacement of £ by 
7 will lead to a reduction in the variance and possibly to an 
increase in the accuracy. 

In simulating the particle trajectory, it is necessary to deter- 
mine the successive random quantities ¢,, ¢,... with densities 
Pi (x), po(x),---,* and the resulting trajectories are used to esti- 
mate the average value of a function/ (&,, &, ..., &,,)- At each stage of 
construction of the trajectory the determination of §, may be re- 
placed by the determination of any other random quantity 7, with 
positive density p,(x). This will yield other trajectories and the 
function to be averaged will have to be replaced by 


(7) Po (ne) Pm (tm) 
S (nye Tye ov oe QS Be), Lott) 
iti yD Cy) Pa (na) Pm(\m) 


Thus, the ‘‘weight’’ of the particle varies in accordance with the 
law 


W,p= Wy 4 Pa Ce) =, 
Pp (1) 
and the averaged function /(7,, %. ..., 7,,)mustbe givena ‘‘weight’’ 


Wm: 

The above analysis leads to the following practical recom- 
mendation: if the weight decreases asthe particle moves along the 
trajectory, then the method of weighting is suitable. 


*Strictly speaking these are the conditional probabilities 
Py (*) = Dg (X18 oer &,_ 4) 


However, we shall not dwellon the more complicated concepts of the theory of proba- 
bility (Markov chains). 
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In the example discussed in the preceding paragraph 
(ate, p(y=—e 
P ~ @ ee ~~ a(1—e7**a) * 


and hence, 


PY 3 g-tag 1. 


It should, however, be noted that in this example the intervals in 
which p(x) and p(x) are defined are different. The analysis given 
in the present section will apply, however, because the averaged 
functions vanish for x > x*, i.e., outside the region G. 


2. SIMPLE INTERACTIONS OF NEUTRONS 
WITH NUCLEI AND THEIR SIMULATION 


7. Effective Neutron Cross Sections 


In neutron physics the interaction of nuclei with neutrons is 
usually described by the so-called effective cross sections. 

Suppose that a homogeneous beam of neutrons is incident 
normally on a monatomic layer of unit area, and let n be the 
number of atoms in the layer per unit area. If the fraction of 
neutrons taking part in an interaction is d then the effective cross 
section « of the nucleus for the particular interaction is given by 
= £ 

The effective cross sections of nuclei (sometimes referred to 
as the microscopic cross sections) for different interactions with 
neutrons are available in the literature[2]. Asa rule, these cross 
sections depend on the neutron energy. The most frequently 
encountered cross sections are the following: 
o,(E) — scattering cross section, c,=a,,+9,, , 
o,.(E) — elastic scattering cross section, 
c,,(E) — inelastic scattering cross section, 
o,(E) — capture cross section, 
o,(E) — fission cross section, 
o,(E) — absorption cross section, o,=«,-+6,, 
o,(£) — total cross section, o,=c,+-0,-+-9,. 

The macroscopic cross sections are defined as the products 


<=, 


where p is the nuclear density, i.e., the number of nuclei per unit 
volume. « is usually measured inbarns where 1 barn = 10°“ em?. 
It follows that the dimensions of 2 are cm™', 
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An example of the calculation of 2, is giveninparagraph12, In 
the case of a mixture of m different media 


P= SFQy t+ Lay ++ + 2m 


The ratios of the various cross sections to the total cross 
section characterize the probabilities of the various interactions 
which occur when a neutron collides with the nucleus. They are 
used in the determination of the history of the neutron. For 
example, when the neutron collides with the nucleus for which 


qr, = r+ r+ Xp 
the probabilities of scattering, capture and fission are respec- 
tively equal to 


BEY 
a Te Tar 
i.e., the lengths of the intervals shown in Fig. 16. The order of 
these intervals is arbitrary, but should be fixed before the Monte 
Carlo method is applied. In order to determine the type of inter- 
action, one finds the value of a random number 7 and determines 
into which of the three intervals it falls. If 


c Es. 
1< ye 
then scattering occurs. Capture corresponds to 
z 24-5 
oan 
while fission corresponds to 
pee a 
Ay y < 
fe) Is Letle 1 
Xt rt 
Fig. 16. 


8. Elastic Scattering 


Consider now the elastic scattering of a neutron by a nucleus 
of mass number A. This event may be defined by two random 
quantities which can conveniently be taken as the scattering angle 
@ in the system referred to the neutron-nucleus center of mass 
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(Fig. 17) and the azimuthal scattering angle %, which are such 
that 


O<bcn, OSy<2n. 


Fig. 17. Scattering of a neutron in the center 
of mass system. 


The angle « through which the neutron is scattered (Figs. 18 
and 19) and its final energy E’ can easily be calculated from the 
laws of conservation of energy and momentum. They are given by 


Acos§+1 
cos vb = ————____——_-; 3.1 
¥ V A? + 2A cost +1 ( ) 
EB’ A?4 2Acos§+1 
eG *< (3.2) 


The scattering is usually assumed to be isotropic in the center 
of mass system, This means that cos § is uniformly distributed 
in the interval (—1, +1) and the angle’z% is distributed uniformly 
in the interval (0, 2n). However, in the laboratory system of 
coordinates (x, y, z) illustrated in Fig. 19, the various directions 
of scattering are not equally probable. It is clear from (3.2) that 
the energy £’ is uniformly distributed in the interval 


(45 


The rule for the determination of elastic scattering which is 
isotropic in the center of mass system consists of the following. 


\ BS eE. 


4 


a 


A 


vs 
5 a oe pee 
“a, 


~~“ 


Fig. 18. Scattering of a neutron in the 
laboratory system. 
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Fig. 19. 


Two random quantities y, and 7, are found and it is assumed 
that 


cos§=2y,—1, y= 2r7y. 


Next, cos » and E’ are determined from (3.1) and (3.2).* 
Equations (3.1) and (3.2) remain in force in the case of aniso- 
tropic scattering. The azimuthal angle xy is again taken to be 
uniformly distributed in the interval (0, 2x), However, the density 
of the cos § distribution turns out to be proportional to the 
differential cross section for elastic scattering in the center of 
mass system o,(0). The formula for the determination of the 
quantity p= cos @is 
ad o 
Qn f 26(8) dp = 7056. (3.3) 
-1 


When o,(6)= const, equation (3.3) becomes identical with that 
given above for isotropic scattering. In fact, it follows from the 


*Since we require not the angle y but only cos yx, it is usual to use Neumann's 
assumption, which enables us to find cos 4 immediately (cf, paragraph 10, formula 
for cos? ). 
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normalizing condition for the probability densities that , (0) =<“ . 


Substituting this into (3.3) we have p = 27—1. 

It should be noted that the differential cross section is more 
correctly denoted by o,(p) since the normalization is carried out 
with respect to». The differential cross section for elastic scat- 
tering in the laboratory system o,(p) is also used. The relation 
between a, (8) and o,()can easily be determined with the aid of (3.1) 
and is 


0, (0) dcos 9 =a, (p) dcos yp. 


In elastic scattering on heavy nuclei A>>1. It follows from 
(3.1) and (3.2) that 


E’=E, cosbmcos, 


Experimental data show that the majority of heavy nuclei give 
rise to anisotropic neutron scattering, so that the corresponding 
differential cross sections o,(/) are not constant. Itis these cross 
sections which must be used in determining the directions, together 
with FE’ =E. 

However, in many calculations the anisotropy of scattering on 
heavy nuclei is neglected. The scattering process is then taken 
to be isotropic in the laboratory system and the equations of 
paragraph 10 may be used to determine the direction. 


9. Inelastic Scattering 


The laws governing inelastic scattering are very complicated. 
It is usually assumed that inelastic scattering is isotropic in the 
laboratory system. It follows that the formulas of paragraph 10 
may be used to determine the directions. 

The energy distribution of inelastically scattered neutrons is 
usually described by empirical or semiempirical curves. The 
energy of an inelastically scattered neutron must also be deter- 
mined probabilistically. 

In the case of inelastic scattering on heavy nuclei it is often 
assumed that the energy distribution of the scattered neutrons is 
Maxwellian, i.e., the distribution density is proportional to 


E’ exp(—E’/T) whereT= aE", a isaconstant which depends on the 
particular medium, and 0 <E£’<£. The formula for the energy of 
the scattered neutrons E’ may be reduced to the form 


qa 


reer 4 8)=rfieer( +9) 
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and hence in each determination of the value of E’ it is necessary 
to solve the transcendental equation 
e*(1-+y)=e, 


where y=E’/T and e=1—y [1 — (1+ 82/7) exp(—£/T)|. When 
0<e< 1 the latter equation has a single positive root which may 
be computed by an iteration method: 


yO >0- is arbitrary; 
yist) == In(1 -+ ys!) — Ine. 


10. Formulas for the Determination of the Direction 
of Scattering 


Let w be a unit vector in the direction of the velocity of the 
neutron before the scattering event. In Cartesian coordinates 
w=wi+o,j+o,k, wf+ 0j-+ of = 1, 
while in spherical coordinates 
o,=sinOcosy, w,=sinbsing, w,—=cosé, 


Let w’ be a unit vector in the direction of the velocity of the 
neutron after the scattering event, and letthe corresponding com- 
ponents be wo’, 5, w, and the corresponding angles 9’ and ¢’. 

The relations connecting the two sets of angles are 


cos #’ = cos fcosp-— sin 8 sin cosy, 


im fol __ siny sin x 
ly —)— ae 


cos b — cos f cos 6” 


sin § sin b% 


cos(¢’ — 9) = 
The relations between the two sets of Cartesian components are 


w, =w,cosp+-sindcosy V 1 —w?, 


| +, [», (cos y sas 45) “_ sin b sin X®, V 1— 2], 
3 


wf == 
2 = 


o = —, [», (cos — 3) — sin y sin xo, V1i— w?], 
W3 

In these formulas y is the azimuthal scattering angle and 4 is the 
angle between the unit vectors w’ and w (cf. Fig. 19 where P is the 
plane of scattering containing the vectors w’ andwo, the plane Q is 
perpendicular to w, the plane XR contains the Oz axis and w, and the 
angle % is measured from the line of intersection of the planes Q 
and R). 
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When the scattering is isotropic in the laboratory system, the 
new direction w’ is independent ofw. Itis convenient to determine 
it to begin with. This can be done as follows: 

1) Take a random number 7 and assume that 


cos 0’ = 27— 1. 
2) Take two random numbers 7, and 7,. When 
(27,— 17+ 4>1, 
the particular pair selected is rejected and another is taken. When 
(27,— 1? + <1. 
then 
(24, —1e—% sp 2 (24, —1) 12 
“ia! Se = Lee 
(24-1 +2 Cau—ly +t 
In Cartesian coordinates 


cos og’ = 


o = V i —cos?6’ cos 9’, 
| 
’ ahs 
wo, = V 1 —cos? 6’ sing’, 
w, = cos 0’. 


11. Fission 


The number of neutrons v which are formed during the fission 
process is a random quantity. As a rule only its average y is 
known. All-the velocity directions are equally probable for each of 
the fission neutrons. The neutron energy isalsoa random quantity 
and for simplicity it is usually assumed that the spectrum of 
fission neutrons is independent of the energy of the incident 
neutron. 

For example, in the case of U2 the average number of neutrons 
per fission is about 2.5, while the neutron energy distribution 
density is approximately given by 


n(Ey=V a sinh / OE e-£ 


where the energy is in mev. 

To describe the fission process by the Monte Carlo method, 
suppose that m<v < m-+1 where m is an integer. In order to 
construct the shower, it is frequently assumed that v takes on two 
values v,==m andv,=m-+1 with probabilities p,—m+1—v and 


Po=v—m. It follows that it is necessary to choose a random 
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number 7 and if 7< p, then it is assumed that v=», and if p, < 7 

then v=». Next, for each of the fission neutrons a Monte Carlo. 
determination is made of the directions and the energies. The 
formulas for equally probable directions are given in paragraph 
10. The formula for the determination of the energy E’ is of the 
form 


E Emax 
f =@ae=1 f «ae. 
Emin Emin 


In many cases in which statistical weights are employed the 
random quantity » is not determined by the Monte Carlo method. 
Instead, the statistical weight of the incident neutronis multiplied 
by . and the direction and energy are determined by the Monte 
Carlo method for this single neutron. In this way, branching 


trajectories can be avoided, 
12. Length of the Free Path 


The free path / of a neutron is a random quantity. The law of 
distribution of the free paths is of the form 


x 
- | 3,ds 
Pii<x}=1l—e i” ‘ 


where s is the distance from the preceding collision along the 
direction of motion of the neutron, The distribution density is 


x 
- [% 4s 


Py (x)=Xe ° . 


In a homogeneous medium in which 2, is independent of s, the mean 
free path is given by /=1/%,. 

Example. Calculate the neutron mean free pathinberyllium. It 
is known [2] that o,=6 barn (at 300°K and energies between 0.1 and 
10,000 ev), In order to determine the number of Be atoms per 
cubic centimeter its density (1.84g/em’) must be multiplied by 
Avogadro’s number (6.02 x 10%) and the result must be divided by 
the mass number (A=9), The final result isp—1.23 x10% cm-$ 
and hence L,= ps, 0.74 cm~ and /=1.3 cm. 

In a homogeneous medium, the free path may be determined by 
the Monte Carlo method with the aid of the formula 


1 
came c (3.4) 
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where 7 is a random number. If the mean free path / is taken as 
the unit of length, then /=—1In ;. If the homogeneous medium is 
finite and the escape of neutrons is allowedfor by the appropriate 
statistical weight (paragraph 5) then Equation (3.4) must be re- 
placed by 


t=— s-nfi—7(1—e™)], (3.4') 


where s* is the distance to the boundary of the region along the 
direction of flight of the neutron. The new statistical weight of 
the neutron is 


w’ = w(l —e*"*). 


The free path of a neutron canbe defined for any type of inter- 
action, for example, scattering. All the preceding formulas will 
then hold except that 2, must be replaced by the appropriate 
interaction cross section (, in the case of scattering). 


13.. Simulation of the Free Path in a Nonhomogeneous Medium 


Consider a region consisting ofa finite number of homogeneous 
regions. In fact, any nonhomogeneous region may be divided into 
small regions within which the composition of the medium remains 
constant. 

The boundaries of the regions are, in practice, plane, spherical 
or cylindrical surfaces, The intersections of any ray with such 
surfaces may be found with the aid of the formulas of analytical 
geometry. The method of calculation given below may be used to 
simulate the free path of a neutron for any geometry. 

In order to obtain an insight into the problem, the calculations 
are illustrated by the example shown in Fig. 20 in which a 
cylindrical region I (active zone of a reactor) is surrounded by a 
cylindrical layer II (reflector) and two cylinders II and IV (end 
pieces), 

Suppose that a neutron at the point r, escapes in the direction 
of the unit vector w’,. The equation of the trajectory (ray) is 


nw=rt+sw’, s>0. 


Rules for the determination of the free path by the Monte Carlo 
method: 

1) Find the distance (along the ray) toallthe bounding surfaces 
for the particular geometry. Inour present example this will yield 
the following seven numbers: 
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Fig. 20. 


8, <8, — distances to the inner cylindrical surface. 
s, — distance to the outer cylindrical surface. 
s, — distance to the planez=H,. 
s; — distance to the planez= H,. 
s, — distance to the plane z=— H,. 
s,; — distance to the plane z=— H,. 


2) Find the distance s* (along the ray) to the outer boundary 
which is the smallest positive number among all the distances to 
the surfaces making up the outer boundary. In our example s* is 
the smallest positive number among the numbers s,, s, and s,. 

3) Arrange all the positive values s,, whichareless than s* in 
ascending order: 


O = 809) < Sry < Say << oes KS 

and calculate the lengths of the following sections along the ray: 
Lj) = $7) — S(j-1)s i=], 7 ee 

Each of these sections lies wholly inside one of the regions. 


4) Find the values ¥,=¥2,,, which correspond to /,,,. To do 
this, it is sufficient to determine the region to which the point 
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Fj) = Pot 0.5 (8, 9-1) + (5) w’ 


corresponds.* 
5) Select a value of a random number 7 and find the number i 
for which 


Loylay tt ee +2 aeataey < I 7 < Bayly be HZ aya 
The free path is then given by 


—Iny—(Caylay + oo. #2 Gey lu-y) 
bly eee lg $—— > a oe 


If there is no value / for which this condition is satisfied, i.e., if 


Zalayt --+ tLe <<—Iny, 


then the neutron is considered to have escaped from the region. 
Having determined the length /, the new position of the neutron 
is found from 


ry ==ry tlw’. 
In our example, for the ray shown in Fig, 20 s*=s,. Next, 


ye ue __ ot 
Say = Sp Say = Sqr $(3) = So. Suy=S5 ‘ 


The section /,,, belongs to II, /,., belongs to I, /,, belongs to Il and 
4(4yalso belongs to the region III. 

The number of calculations involved in the above scheme may 
frequently be reduced by allowing for the specific properties of the 
particular geometry. Thus, if s;>0 in the example considered 
above, then s, need not be calculated at all, and so on. However, 
this leads to a complication of the logical scheme of the calcula- 
tions and therefore the programming is also more complicated 
(except for a very simple geometry). 

Consider now the determination of the free path of a neutron 
when the possibility of escape is allowedfor by means of statistical 
weights (paragraph 5). In this case the scheme will also consist 
of rules 1, 2, 3, and 4 and thereafter the procedure is as follows: 

5) Select ene of the successive values of a random quantity 

and calculate the quantity 


P= 1—y[1 —e Cat yy], 


*It is clear from this that the programs should contain a part which determines the 
range to which any selected point r belongs. 
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Find the number i which is such that 
Satay + FB y-yla-y < lal < Bayly ee +2 aya 
The free path is then given by 


—InT— (Cala +... +2 a-yla-n) 
Sy ha 


The new statistical weight of the neutron is given by 
= w[1— eC! tty), 


14, Statistical Weights Replacing the Monte Carlo 
Determination of Interactions 


We shall consider the two most frequently encountered methods 
of introducing these weights. For simplicity, we shallassume that 
there is only one type of scattering (elastic or inelastic). 

Consider a bunch consisting of a large number w of neutrons. 
During a collision w(Z,/5, neutrons will be absorbed, w (E/E) 
neutrons will be scattered and w(Z,/Z,) neutrons will give rise to 
fission as a result of which vw(Z,/2,) fission neutrons will be pro- 
duced. It may be supposed that after a collision there will be two 
neutrons, namely, a scattered neutron with a weight 

2; 


w’ aude (3.5) 


and a fission neutron with a weight 


ww’ =w =a (3.5') 

Consider now the neutron bunch again, but assume that during 
the entire free path neutrons are continuously removed from the 
bunch as a result of absorption. Since the probability that a neutron 
willbe absorbed intheinterval(x, x-+-dx)is equaltoZ,exp(—2,x)dx, 
it follows that the total number of neutrons which will be absorbed 
from the bunch in a free path will be w(1 — exp(—2,/)). 

As before, a fraction of the neutrons will be scattered while a 
further fraction will give rise to fissions. It may be considered 
that two neutrons will be produced, one of which will be the 
scattered neutron with a weight 


mee, | = 
w’ = we “ec +e (3.6) 
while the other will be the fission neutron with a weight 
sy Vy ‘ 
w'=we « i +5;" (3.6) 
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In the second method of calculation, the path length is deter- 
mined from 2,-+-2, and not from &,. 

In both cases considered above, the type of interaction occurring 
during the collision is not determined, andthetrajectory branches 
out into two each time (if it were desired to allow for both elastic 
and inelastic scattering, then the trajectory would have to be 
branched into three each time). 

When the medium does not contain fissile material (£,—0), 
formulas (3.5) and (3.6) become very simple. Itis then unnecessary 
to carry out a Monte Carlo calculation ofthe absorption and there 
is no branching. 


3. PASSAGE OF NEUTRONS THROUGH A PLATE 


The present section will be concerned with the passage of a 
uniform beam of neutrons through a plane~parallel plate. It will 
be assumed that the plate is homogeneous and contains no fissile 
materials so that the total cross section is given byt,—2,+-%,. 
This type of problem is often encountered in practice, for example, 
in the design of reactor shielding, and a large number of papers 
have been devoted to it. In different problems it is necessary to 
determine the number of transmitted neutrons, their energy dis- 
tribution, the reflecting power of the plate (albedo), the energy 
distribution of the absorbed radiation, and so on. 


15, Simulation of Physical Trajectories 


Let the Oz axis be perpendicular to the plane of the plate, and 
consider a point z suchthat 0<z<h, with the neutrons incident onto 
the z=0 plane. The calculation is particularly simple because of 
the one-dimensional geometry. The state of a neutron is char- 
acterized by only three quantities, namely, the z coordinate, the 
energy E and the direction of flight defined by »—cos 9 (Fig. 21). 
The initial values for the trajectories are selected as follows: The 
initial value of z is always taken to be zero, while the initial 
direction and energy depend on the properties of the incident beam, 

Consider some special cases. 

a) Monoenergetic beam; here the initial energy EF, is given. 

b) The energy spectrum n(£)ofthe incident beamis given. The 
initial energy E, is then determined by the Monte Carlo method, 
using the formula 


Ey Emax 
fn@ae= 1 A) n(E) dE. 


Emin Enin 
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Fig. 21. 


c) A directed beam; here the value p», is given. 

d) Spatially isotropic beam. The value of p, is determined 
statistically from the formula p»,— 7. 

Subsequent calculation of the trajectory is carried out as 
follows. 

1) The coordinate of the nth collision (n=1, 2, ...) is deter- 
mined statistically from 
Pn-iIny 
Ly (En~1) * 

2) The following conditions are then verified: (a) Ifz,—’>0, 
the neutron has passed through the plate, and (b) if z, <0, the neu- 
tron has been back-scattered. The trajectory terminates (and a 
count is recorded in the corresponding counter) when either of 
these two conditions is satisfied. 

3) If 0<z,<A then the fate of the neutron after the collision is 
determined statistically by taking successive values of a random 


quantity 7. If 7 << then the neutron is considered to have been 


absorbed and its trajectory terminates (this is recorded in the 
corresponding counter), 


4) If = <7 then the neutron undergoes a scattering event and 


2n = 2n-1— 


its direction and energy must be determined statistically. 
In particular, if the scattering is elastic and isotropic in the 
center of mass system then 


cos6=2y—1, 7=2ry,, 
1+ Acos& 


cos 8 S= — 
* V1i+2A cos 0-4 A? 
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A2+2AcosO+1 
ri rr 
Pa =Pn-1cosp-+cosz V (1 — p2_,)(1 — cos? 4). 


Thus, a neutron in the state(z,_,, p,_;, E,_ undergoes a transi- 
tion to the state (z,., »,, £,). The calculation of the trajectory is 
continued until one of the three conditions for its termination 
(transmission, reflection or absorption) is satisfied. Next, the 
values of 2), py, &, are again selected (or are statistically deter- 
mined) and are used to calculate the next trajectory. 


16. The Use of Statistical Weights Representing 
Absorption 


When the fraction of transmitted neutrons is small (this is a 
typical case in shielding design) then it is clear that the above 
method will not be very efficient. For example, if the probability 
of transmission is of the order of 10~° then in order to obtain, 
say, 10 transmitted neutrons, it is necessary to construct about 
10° trajectories, and the probable error inthe determination of the 
probability of transmission will be about 20%. In such cases the 
simulation of physical trajectories is inconvenient and statistical 
weights should be introduced instead. 

Each neutron state is then described by four quantities (z,, 
vp £, @,) and the initial values z), »,, £, are chosen as in para- 
graph 15 with a usually assumed to be equal to unity. Subsequent 
calculation of the trajegtory is continued as in paragraph 15, 
except for step (3); the absorption of the neutron is not determined 
statistically and instead its weight is adjusted so that 


Xs (En-1) 


a id a SF (A Bi 


This method of computation allows us to conserve a large 
number of trajectories when the absorption is large. 


17. Calculation of the Probability of Transmission 


Suppose that it is desired to determine the probability p(t) of 
the transmission of neutrons through a parallel plate. In the case 
of physical simulation (paragraph 15) a count isassumed to occur 
in the counter whenever a trajectory ends in transmission 
(z, —h > 0). When, out of a total number N of such ‘‘case histories,’’ 
the number resulting in transmission is N’ then the required 
probability is given by 


pi) (3.7) 
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In the calculation involving the statistical weights (paragraph 16) 


the counter reading is increased by w,, which are the weights of 
the transmitted neutrons. The required probability is then given by 


pi), (3.8) 


where > is evaluated over all the transmitted neutrons and >) over 
all the trajectories; /=1, 2, ..., N arethe serial numbers of the 
trajectories, andz=0, 1, 2,..., 2, arethe serial numbers of the 
collisions inside the plate for the trajectory withnumber /. 

When w, = 1, the denominator in (3.8) is equal to N. 

In order to illustrate why the second methodis as a rule better 
than the first, let us considertwo simple random quantities §, and 
N,- Let Q, be the probability of transmission without collisions 
for a neutron from the vth state, which is assumed to be known, 
This probability can easily be calculated from the formulas given 
in paragraph 12: 


t, (E,) 8 
Q,=} e (CT, , Where yp, > 0; 
0. where p, <0. 


The random quantity corresponding to the first method of cal- 
culating p(A) is 
__f 1 withthe probability (2/2, Q,, 
=| 0 withthe probability 1—(Z/=,)Q,, 
while the random quantity corresponding to the second method is 


7 =e with the probability Q,, 
0 with the probability 1— Q,, 
The mathematical expectations of these quantities are 
a Mie oe 
Mig= Mra= (31) 


In order to compare the variances, it is sufficient to compare the 
mathematical expectations of the squares 


Mis = (-s¢) Qn 
wa—()e 
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It is clear that Mui <M, and hence* 
Du, < DE,. 
It can be seen from the above discussion that it would be still 
more convenient to carry out a direct calculation of the quantity 


(2,/2,)Q, for each collision separately. We are thus led to a third 
method of estimating p(h): 


PO (3.9) 


2 


The latter formula involves a considerable volume of calcula- 
tion for each collision. Nevertheless it does leadto a considerable 
economy as compared with (3.8) if the same accuracy of the final 
result is required. By using (3.9), more information is extracted 
from each collision. 


18. Further Remarks on the Use of Trajectories 


In constructing a trajectory with a statistical weight it is not 
essential to continue the trajectory until the neutron leaves the 
layer. The trajectory may be terminated when the weight w, be- 
comes smaller than some given number e. which is, of course, 
chosen on the basis of physical considerations. Other conditions 
for terminating the trajectories may arise in various specific 
problems. For example, if we are interested in the absorption of 
neutrons within a given spectral band, then the trajectory may be 
terminated as soon as the neutron energy leaves the energy band. 

During the calculation of the trajectory, a considerable amount 
of useful information may be extracted from each collision. For 
example, in order to find the probability g(h) of reflection of neu- 
trons by the plane-parallel plate, one can compute the reflection 
probability 


2 
E\—& 
a= etl n) Yn, where Lp < 0, 


0, where p, >0. 


*The quantities €, (and also 4,) corresponding to different n are not independent, 
It follows that this demonstration of the advantages of this second method as compared 
with the first method is not rigorous. 
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The formula for. qg(#) analogous to (8.9) is 


At the same time, it is possible to calculate the probability of 
transmission (or reflection) for a plate of some other smaller 
thickness h’< h. For thethinner plate the neutron may be regarded 
as transmitted whenz,—h’ >0, Even when subsequent collisions 
result in the return of the neutron into the thinner plate it need 
not be taken into account. For example, the neutron whose tra- 
jectory is shown in Fig. 22 is reflected by the thicker plate after 
n,==8 while for the thinner plate it is regarded as transmitted 
after ny =3, 


It follows that 


where =. is taken over all the escapes from the thinner plate and 
ny is the serial number of the last collision in this plate. It is, of 
course, possible to use a formula of the form of (3.9): 
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where 


for By > 0, 
0 for p,<0, 


All these formulas are, in fact, special cases of (3.10). It is 
true that in paragraph 20 the trajectories are constructed using 
x, and not &, and absorptionis allowedfor by weights of a different 
type (cf, paragraph 14), but this does not alter the basic situation. 


19. The Method of Similar Trajectories 


Morton [210] has suggested another method whereby trajec- 
tories constructed for a homogeneous plate H may be used in 
calculations for a similar plate H’ of thickness A’ =ah. He has used 
this method to obtain the curve p= p(h) for a particular problem, 


and has estimated the derivative (35) . 
A=0 


Fig. 23. 


By altering the scale by afactor ofa, i.e., by assuming 2’ —az, 
each trajectory in H may be put into one-to-one correspondence 
with a trajectory in H’ (Fig. 23). The directions and energies for 
the new trajectory are already known statistically and may be 
associated with the corresponding collisions. However, the lengths 
of the sections of the broken curve were determined statistically 
on the old scale, According to paragraph 6 this distortion may be 
compensated for by the introduction of additional weights. This, in 
fact, is the principle of the method of similar trajectories. 

In addition to the new weights w’, it is convenient to introduce 
the weights v’ with which the neutron reaches the point 2’. The 
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weights w’, correspond to the weights w, which were used to deter- 
mine the original trajectories in H and which allow for the 
absorption. It may be considered that in H,v,=@,_,butv,+o'_,, 
since v’, takes into account the distorted statistical blceotinctien 
of the path length. 

Thus, consider the first section of the brokencurve in H which 
has a length 


21 — 2 = Polo» 
where /, is chosen from the density 
p(x) = Lge ** 


(for the sake of brevity %,(E,) will be replaced by ,). For the 
broken curve in H’ the length of the first section is 


%— 2) = Blo» 
where /} is also chosen from the density 
@ (x) = Lye, 


However, instead of this statistical calculation it was assumed 
that /}=<al,. Since the density of the product ai, is 


ai=(=)e (2 ae 


it follows that the weight which must be introduced (paragraph 6) is 
equal to 


Yi 
2(4) = gell—9) Xl” 
91 (4 
Assuming that w}—w, we have 
U, = wae") Plo, 
Ww = welt) Yolo, 
The formulas for the new weights may be derived in a similar way: 
g y 
v. nat yw n-1’ @=f,ty 
where 
fp ee [Fe Fo) fot os +84 Epa) nal 


In order to determine /, one can use f,=1 and the recurrence 
formula 


j,= f,-we""" 24 (FEn—1)  n-1, 


112 Ill, APPLICATIONS IN NEUTRON PHYSICS 


Formulas for estimating the probability of transmission of 
neutrons p(h) through the plate H are given in paragraph 17. In 
order to establish the analogous formulas for H’ we note that 
when p,, > 0 we have 


a’ 


, —2 (En) 


Q,=e 


2, x, (Ex) h-z, 
By e ta Pp =(Q,)". 


Bearing this in mind we have* 


and correspondingly 


The method of similar trajectories may be used to carry out 
simultaneous calculations for a series of plates: after each colli- 
sion a series of weights is determined (forall the required «) and 
these are accumulated as the trajectories are completed. 

Suppose now that it is necessary to calculate the transmission 
of neutrons through plates having thicknesses h,, fy, ..., h,. The 
problem arises as to which is the basic plate. There are grounds 
for assuming that the most convenient procedure is to take the 
plate with the greatest thickness, so thatallthe factors « are less 
than unity. The accuracy of the calculations will then be of the 
same order of magnitude for all the plates. 


20. Simulation of Collision Density 
It is usually assumed that a large internal memory is not re- 


quired in computers used for Monte Carlo calculations since each 
trajectory is simulated independently of all others. However, if a 


*The sum of the weights of the transmitted neutrons is equal to 


= . 
fi ajtl fa atl ay 
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large memory is -available, then the collision density can be 
effectively simulated, i.e., all the states of allthe neutrons may be 
stored. These canthenbeusedto solve analytically a large number 
of similar problems using the same trajectories. This represents 
the most complete use of all the information obtained during the 
realization of the neutron trajectories. This type of calculation 
was first used by Berger et al. [78, 791 in the study of the trans- 
mission of gamma-radiation. 

Suppose that a monoenergetic beam of neutrons with initial 
energies E, and angle of incidenceé,=0 is incident on the plane 
z=0 from the region z<0. Thus, theinitialstate of each neutron 
is z,=0, py=1, E). The trajectory of each of the N neutrons will 
be constructed without allowance for absorption* or forthe bound- 
ary conditions: 

1) The coordinate of the ath scattering (nx =1,2,...) is deter- 
mined statistically: 


_ ___ a1 F 
4—%1—-F pe 


2) The new direction p, and the new energy £, are determined 
statistically as in paragraph 15; 

3) The condition for the termination of the trajectory is verified 
(it is convenient totakethis conditioninthe form F,<« wheres is 
a given number). 

All the states are recorded so that for the jth neutron we have 
the following series of states: 


ZV). po), Ey ti); 
40) wh), 4:0); 


2n, (/). Yn, U)» En, (J). 


where j=—1, 2, ..., N. These states may be regarded as an 
empirical sampling from the collision density g(z, », £) which 
represents the probability density for the scattering of a neutron 
at the point z in the direction » with an energy E. 

Since most of the quantities which are of interest in such 
problems are expressed in terms of integrals containing the 
collision density g(z, », £), it follows that the approximate deter- 
mination of these quantities can be reduced to the evaluation of 
certain sums over the recorded states. 


*If all the states are recorded the weights can be introduced later, 
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Consider the quantity p;,(4) which is the probability of trans- 
mission of neutrons through a plate 0<2z<# where the direction 
and energy of the neutrons escaping from the plate lie in certain 
fixed intervals Ap,, AE,. A knowledge of all the p,,(h) is equivalent 
to a knowledge of the spectrum of the transmitted neutrons. Con- 
sider the sum 


N 
Puhr >, 32, 4) Qn. Fn * Y). (3.10) 


j=1n=0 


In this sum: 8, (/) represents the geometrical boundary conditions 
of the problem. In our case, 8,(/)=1 when 0<2z,(/)<aAfor all 
m=0, 1, 2, ..., 2; in all other cases&,(j)=0. The weights w,(/) 
represent absorption: 


Zm21 (/)-2m 


n—1 7 
~ 35 (Em SY) Bin WD) 


w,()=[[e 
m= 
The product is equal to the probability that there will be no absorp- 
tion along the entire path from the beginning of the trajectory to 
the nth scattering. 

The function Q,(/, #) is the probability that a neutron in the nth 
state will intersect the plane z—A without undergoing any further 
scattering. It is clear that 

Zn (i) 


—85 (Ey wy a <i ,when = ih 5, 9, 


Q,0, = 


0 when ” rere a0: 


Finally, the quantity f/*(j) gives the seat classification: 


ae 1,whenp,(/) € Ap, E,(/) € AE,, 
fi N= 0 in the opposite case. 
The spectrum of neutrons reflected by the plate0 <z<A is 
calculated in a similar ae 


i , 
m7 Dy Sa, (/) (4) Qq (J. 0) fa ® (). 


=1 n=0 


All the quantities inthis formula have the same meaning as before. 
Similar formulas may be written down for the neutron flux and 
a number of other physical quantities. 
The terms corresponding to n=0 in Equation (3.10) represent 
unscattered neutrons and can occasionally be conveniently replaced 
by an analytical expression. 


3. PASSAGE OF NEUTRONS THROUGH A PLATE 115 


Instead of simulating the passage of neutrons at different angles 
of incidence 6, it is possible to confine the analysis to the case 
6) = 0 and yet consider trajectories at all possible orientations. If 
the complete states x,(/), ya(/)» Zn(/)s Ba(J)s S_n(/)» EnV) have been 
recorded, then it is easy to compute the transmission through a 
plate of given thickness / by orienting it in space at the required 
angle (Fig. 24). 


Fig. 24. 


Instead of z, andy, in Equation (3.10), it is necessary to use 
the quantities 


Ay é 
Z,==%X, sin fy) z, cos 9), 


u, =cos 9, cos %,-+ sin 6, sin 99 cos 7, 
21. A Numerical Example 


In order to compare some numerical methods of solution of the 
transport equation, a calculation was made of the transmission of 
neutrons through a beryllium plate (A=9;5,—0;6,=>,=<¢,,). The 
scattering was assumed to be isotropic inthecenter of mass sys- 
tem.* The thickness of the plate was 4 = 5 mean free paths (cf. the 
example in paragraph 12). The energy £ was replaced by the so- 
called lethargy, which is defined by z=In (££). The incident 
neutron flux density was assumed to be constant for 0<»< 1 and 
O<u< 12. 

The method of paragraph 20 was used to calculate the number 
I,, Of transmitted neutrons with directions and lethargies corre- 
sponding to the above intervals, for a unit area perpendicular to 
the direction of exit. Since the density of the computed quantity 
is given by 


*The calculations were carried out by OB, Moskalev, I.G, Krutikova, and V. A, 
Chuyanov under the general direction of E, S, Kuzetsov. 
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_hmz Lh 
T(, ps, n= f ge p, “le # +10, p, ue , 
0 


the computational formula was taken inthe form / 
{ 


N I h-z, ft ‘ 
I 1 i,k rs 
n~y YD ne Ya fa be *t (iy1— tps 
=ln=1 


where p,=0.5(»,-+-p,,,). The following five intervals (groups) of 
values of the lethargy were considered: 


O<u<38; 3u<6; 6Su<9; 9SuU< 12; IWu<co 
together with the following twenty intervals of values of p: 


Papi — By = 0.05. 


Formulas for the calculation of the trajectories.* The initial 
values were: z)==0, y=. 4%=12y. The transition from the ath 
collision to the (2-++-1)th collision was taken to be 


Zn41 = Z_— thy INT; 


a 
cos X == aa , provided that y{i-+ 73 <1; 


+e 
cos § = 2y— i: 
1+ Acos¥ 


cos 8 SS 5 
4 V1+42A-cos 6+ A? 


Png1 = Hp, COSY — COSY, (1 — p?) (1 — cos? ); 


y= U,— In (1+ 2A cos 6-4 A?) 21n(A-+ 1). 


Results (obtained with 9,000 trajectories), Figure 25 shows 
histograms of the angular distribution of transmitted neutrons for 
each of the energy groups, and also the values obtained by the 
multigroup method of solution of the transport equation [44]. 

Table 10 shows the values of the flux average with respect to 
the solid angle.** 


*Use was made of the pseudo-random numbers {y} given by Sobol! [57]. 
**The first group 0 <u < 3is"toolarge" for the multigroup method, This explains 
the discrepancy between the two results in the first group. 
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6<u=9 


9<uele}- 


Fig. 25. 
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Table 10 
Group 
Method 
Ogu<3 3qu<6 6ou<9 9cu<12 Weuctw 
Monte Carlo... 0.0494 0.189 0.230 0.250 0,337 
Multigroup ... 0.1084 0,192 0,240 0. 264 0,340 


4, SOME METHODS OF CALCULATION OF THE 
CRITICALITY OF NUCLEAR REACTORS 


22. Formulation of the Problem 


Consider a volume R containing afissile medium (for example, 
uranium), A neutron entering this region may escapefrom R with- 
out colliding with the nuclei, or it may interact with a nucleus and 
thereby become absorbed or scattered, or it may give rise to 
fission yielding v new neutrons. A large number of such interac- 
tions will lead to a change inthe average number of neutrons in R, 

The volume R is called critical if the average number of 
neutrons within R remains constant. If the number of neutrons 
decreases, then the volume is called subcritical, while if the 
number of neutrons increases, the region is referredto as super~ 
critical. 

In a subcritical region a chain reaction willnot be maintained. 
In a supercritical region the chain reaction will develop and lead 
to an explosion. Roughly speaking, the control of a reactor con- 
sists in the maintenance of the critical state. 

Usually, the active zone of a reactor, whichcontains the fissile 
material, is surrounded by a reflector which is in the form of a 
medium with low neutron absorption cross section (graphite, 
beryllium). The presence of the reflector reduces the leakage of 
neutrons from the system, and ensures that the critical state is 
reached with a smaller amount of fuel (cf. Fig. 20). Moreover, a 
reactor incorporates control rods which are made of a material 
having a highneutron absorption cross section (boron). The control 
of the reactor is carried out by introducing or removing such rods 
from the active zone. 

The effective neutron multiplication coefficient k er of a reactor 
is defined as the average number of secondary fission neutrons 
per primary fission-inducing neutron. More precisely, letn, be the 
number of all the neutrons belonging to a given generation and 
consider the fate of each of these neutrons prior to its escape 
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from the reactor, or untilthe time when it becomes absorbed (with 
or without fission). If the number of second-generation neutrons 
associated with the original n, neutrons is n,,then k ett = M/A (a 
more precise mathematical definition of kes is given in para- 
graph 24). 

It is clear that the reactor will be in a critical state when 
k eff =1. When & eff > 1 the state is supercritical and when & er <1 
it is subcritical. 


23. Simulation of Physical Trajectories 


Suppose that a finite number Ny of neutrons is placed in the 
reactor. It is desirable that the distribution shouldbe close to the 
true distribution although, in fact, it can be arbitrary. Let us 
select a time interval At which is several times larger than the 
mean lifetime T of a neutroninthereactor. A Monte Carlo deter- 
mination of the history of each of these neutrons during the time 
At will yield the number of new neutrons n,. After a_sufficiently 
large number j of such time intervals the ration,,,:n,will follow 


_ , the formula 


At 
My 41 ny pw el eff ek 

and its magnitude will be an indication of the criticality of the 

reactor. 

However, the actual realization of this type of calculation turns 
out to be quite complicated. The true distribution of neutrons in 
the reactor is in general a function of six variables, namely, the 
components of the position vector r and the components of the 
velocity v (instead of the latter the unit vector »~9/v and the 
energy E of the neutron are often used). 

A large number of neutrons is necessary in order to reflect 
the properties of this distribution correctly. 


24, Determination of the Number of Generations 


Suppose that the spectrum of fission neutrons is independent 
of the velocity of the incident fission-inducing neutron. In this 
case it is convenient to determine the criticality by determining 
the number of generations. 

Let 2, neutrons be placed in the reactor and suppose that 
these are the first-generation neutrons. A Monte Carlo determina- 
tion of the history of each of these neutrons right up to its dis- 
appearance (capture, absorption with fission, or escape) willyield 
the number of second-generation neutrons. After a sufficiently 
large number é of generations the ratio 7,,,:, will be given by 


Nyy. 2 Ny Rose, 
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In this method of calculation the simulation procedure is con- 
cerned only with the distribution of fission neutrons rather than 
with the distribution of all the neutrons in the reactor. By defini- 
tion, the true fission-neutron density (birth rate) is equal to the 
productzx() n(r). The spectrum of fission neutrons x(9) is known, 
so that it is sufficient to simulate the density n(r) which is, in 
general, a function of three variables. It is clear that in the re- 
flector» (r)=0. 

Consider now the mathematical significance of this method of 
calculation. Let g(r, 7’) be the density of the average number of 
secondary neutrons at the point r which are produced by a single 
primary neutron from the point r’. The density of secondary 
neutrons is then given by 


m(r)= fer. rym (e)ar’. (3.11) 


The transition from generation to generation is equivalent to an 
iteration of the initial density n,(r) by means of the integral 
operator (3.11). The kernel of this operation g(r, r’)is very com- 
plicated, and it is not useful to write it out analytically; it is 
approximately realized in the computation of the histories. The 
kernel is positive for the active zone. 

It is now possible to give a rigorous definition of the effective 
neutron multiplication coefficient in a reactor. The requirement 
Ny (r) = eff n,(r) is equivalent to 


f g(r. r’) ny (r’) dr’ =R egg 1, (7). 


The latter equation shows that the density n,(r) should be an eigen- 
function of the integral operator (3.11) and k egg its eigenvalue. 
Since 7, (r)>0 (in view of its physical significance), it follows that 
k eff is the largest eigenvalue of the operator (3.11). 

It is known from the theory of integral equations that, subject 
to very broad restrictions imposed on the positive kernel, the 
iteration of 9,(7) by any nonnegative function g(r) leads (with suit- 
able normalization) to the first eigenfunction 2z,(r) of the kernel, 
and the ratios 9;,,;(7)/9,(r) at each point tendto the first eigenvalue. 
This is the basis of the method of calculation given above. 

Instead of the estimator & ops n;,,/n,, Hammersley and Morton 
[148] have suggested the use of a maximum likelihood method. 
Thus, if it is considered that when i—s_ the ratio has already 
““settled down,’’ while the count of the generations is continued up 
to i=1/, then 

ee ni—n, 


eff a— nm 
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where n=n,-++2,,,-+ ... +2, The corresponding estimate of the 
variance is of the form 


~ 2 
| 
Di ett * (2=*) Fear 


where vy is the average number of neutrons per fission. 


25. The Method of Moments 


This method involves the construction of a finite matrix which 
approximates to the integral operator (3.11), and the determination 
of its maximum eigenvalue &, whichis approximately equal to k fs. 

We shall now explain how anintegral operator may be approxi- 
mated by a matrix. 

Suppose we have a complete set of functions /,(r), f,(r). ...- 
Any function 2(r) can then be expandedinterms of these functions: 


n(r)= cf (r)+ Cofo(r)+ «-- 


Thus, to each function z(r) there corresponds an infinite vector 
{CyCo +++} The constants ¢c,, c,, ... are called the Fourier co- 
efficients of the function n(r) with respect to the complete system 


LO, fo”, eece 
Suppose further that the integral operator K in 


n(y= f K(r, r’)n(r’) ar’ 
transforms the functions /,(r) intof;(r) so that 


AO=f[Ke ry fear’, 
and let 
A:OM=anfi)+epfh@)+ .. | i=1, 2... 


It is clear that the infinite matrix ||a,,|| is equivalent to the 
integral operator K since it may be used to find the vector 
{¢), ¢y,...} Corresponding to the function n(r) from the vector 


(C1, Co, 2}: 
C; = Ay jC, Agilo+ ---s i=1, 2,... 


If only a finite number m of thecoordinates is retained, so that 
the vector becomes {c¢,,..., ¢,,}, then one can consider the finite 
matrix ||a,;||. 1<i, j/<m which transforms the vector {c;, .... Cm! 
into the vector {c,, .... ¢,,}. This is, infact, the finite matrix which 
approximates the integral operator XK. It transforms functions of 
the form 
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a(ry= Cf, ()+ .-- Heat OY) 
into the functions 


AM=AhOt --. Henin: 
so that 


Cp Ay Cy vee Omg =A, 2,0.-5 me, 


Gel’fand et al. [19], who were the first to use the method of 
moments, have calculated a matrix which approximates the oper- 
ator conjugate to (3.11) (the eigenvalues of conjugate operators 
are equal), Consider the history of a neutronborn at the point ri‘). 
Let v(m) be the number of secondary neutrons produced by the 


latter neutron at the end of the /thhistory at the point wy (in the 


case of eScape or capture y=0). Having accumulated N such 
histories beginning at the same point rj we find the approximate 
values of the integrals 


N 
2 1 
F(rgy=f g(r. r{s)) f,¢ army 3 ¥ (08) f; (7): (3.12) 


They are evaluated simultaneously for all i=1,2,..., m. Having 
selected p such points r)(s=1,2,... »P) where p~> m, and having 
calculated for each of them the value of Fi (ri), we choose the matrix 
a, so that 


mt 
24 4 P11, 2 Ser05, MNS 
Fueey~ & a, if j (7%). s=1, 2,...5 p. 


The determination of a,, may be carried out, for example, by the 
method of least squares, so that 


The variances D,, are estimated from the same case histories. The 
largest eigenvalue of |ja;;|| is then formed by purely algebraic 
methods. 


m 


Bat (”)— ae ets(et)| =a 


m 


26. Normalization of the Number of Neutrons 


The following difficulty may be encountered in the realization 
of the methods described in paragraphs 23 and 24. When & eg, is 
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appreciably greater than unity, the number of neutrons will rapidly 
increase and may overload the computer store before the density 
profile is established. Conversely, if keff ismuchless than unity, 
the number of neutrons may decrease rapidly, or may even become 
zero. 

This degeneracy is not catastrophic when it is required to 
determine the value of some reactor parameter which will ensure 
criticality. It indicates the direction in which the parameter should 
be varied. However, the possibility of interpolation between the 
calculated values of keff is then lost. Bouquet et al. [84], using 
the method of paragraph 24, avoided the above degeneracy by 
introducing a spatial grid which enabled them to normalize the 
number of neutrons in each generation. The active zone was divided 
into elements (0.1 of a free pathin diameter), and all the neutrons 
produced in a given element were placed atits center. The initial 
generation density n,(r) was replaced by the numbers of neutrons 
produced in these elements ny, mp, ..., ™, 80 that my,—--ay-+ ... + 
ny,,== M. When the number of secondary neutrons 1g;, My), «. «, My, had 
been found, the required ratio was calculated from 


No, + Mog + eee + tos 
M . 


This was then followed by the normalization of the neutrons: the 
new n,, were found by multiplying the old n,, by the reciprocal of 
the above ratio, i.e., M/(m.,+ ... —+-m,)- 

In this method of calculation the numbers x, turn out to be 
fractions and play the role of weights (masses). It is possible to 
construct a single trajectory for eachofthe elements and multiply 
the weights of the next-generation neutrons by ,,. For those ele- 
ments in which m—1<tn,,< m, it is moreconvenient to initiate m 
histories and multiply the weights of the resulting neutrons by 
n,{m, Where m is aninteger andthe value of ¢ is chosen in accord- 
ance with the value of M, 


27. Application of Weights 


The advantages of the method of counting of generations, as 
compared with the simulation of physical trajectories, lie in the 
fact that ordinary trajectories can be considered instead of 
branching trajectories. It is thus possible to use the various 
methods of introducing statistical weights which were discussed 
above. In particular, the weights allowing for absorption (para- 
graph 14) and the weights discussed atthe end of paragraph 26 are 
often employed. The statistical weights discussed in paragraph5, 
which represent the escape of neutrons, are less frequently used. 
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More complicated formulas [analogous to (3.9)1canbeused simul- 
taneously with the statisticalweights. For example, a simple count 
of the descendants of a neutron, i.e., thenumber of neutrons avail- 
able at the end of a case history, may be replaced by 


where v, is the statistical weight of a neutron approaching the nth 
collision. Similarly, the integrals in (3.12) may be estimated with 
the aid of the formula 
NY 
~ v Zy (E,) 
FRuee)~ aL Lee sey f(r) 


j=l a=l1 


- 


in whichw,=1. 


28. Determination of the Critical Values of Reactor 
Parameters 


The methods of paragraphs 23-25 may beusedto calculate & eff 
for a givenregionR, However, a reactor model frequently contains 
some parameter ~ whose critical value must bedetermined. This 
parameter may be, for example, the concentration of uranium in 
the active zone, the thickness of the reflector, and so on, 

The critical value of x is the value which corresponds to & eff =1. 
Thus, in order to determine x ,,; it is necessary to determine a 
number of values of the function & ef = F (4) andcarry out an inter- 
polation leading to F(% opi)=1. 

As a rule, the same histories may beused to determine several 
values of the functions F(z). For example, let « be the thickness of 
the reflector, and consider a number of valueszx=%4<%<...<%, 
(Fig. 26). During the determination of the history let us record the 
maximum layer number reached by the neutron. Ifthe neutron has 
reached layer number / and then returnedtothe active zone giving 
rise to a fission, then the fission neutrons are not included in the 
next generation forx=x,..., %j;;- This method is completely 
analogous to that described in paragraph 18. 


29. Calculation of the Criticality on the One-Velocity 
Approximation 


The so-called one-velocity or one-group approximation is 
occasionally used, mainly with fast neutron reactors. On this 
approximation the energy of all the neutrons is assumed to be 
constant. We shall consider a further method of calculating the 
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Fig. 26. 


critical parameters within the framework of the one-velocity 
approximation (the methods of paragraphs 23-25 will still hold). 

In the present case the criticality condition may be given in the 
form of an integral equation which is usually known as the Peierls 
equation. In the case of isotropic scattering this equation reads 


n(r)=h J: Bir’) K(r, r’)n(r’) dr’, (3.13) 
R 


where 
P=E,-, a=, 
Kr, rye Sees »|r—r'|-*, 


and the integral f ads is evaluated over the straight line connecting 
the points rand r’: 
[r’-r| 


foas= f a(r—+t-ws)ds, w= at. 
0 


The criticality condition is of the form \4,;—1 where i, is the 
smallest eigenvalue of the integral equation (3.13), i.e.,1/, is the 
largest eigenvalue of the integral operator px. The region R is 
subcritical or supercritical when },>1 or 4,<1, respectively. 
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Thus, the determination of the criticality in the one-velocity 
approximation is reduced to the determination of the smallest 
eigenvalue of the integral equation (3.13). 

However, if the geometrical structure of Ris complicated, then 
it is very difficult to find 4, by ordinary numerical methods. 

Vladimirov [16] has suggested the determination of }, by the 
Monte Carlo method. Such calculations have been successfully 
completed by Vladimirov and Sobol’ [18, 59]. Since the methods 
employed were concerned with the solution of arbitrary integral 
equations, we shall not discuss them in detail. We shall merely 
note that the law for the construction of random trajectories which 
was realized by Vladimirov [18] is equivalent tothe use of statis- 
tical weights of the form 


a-1 
- a(r wo, 8) 4 
{ (n-1t+®q—15) | 8 (ra) 


a (Fn) 


Wy = Wn-1 a 


which represent both the escape of neutrons and the probabilistic 
method of determination of the interactions,* 

Kellogg’s method of successive approximations was used inthe 
above papers to calculate },. A variational method can also be 
used. In either case, it is necessary to evaluate certain compli- 
cated integrals and this can be done by the Monte Carlo method. 


*Fission and scattered neutrons are equivalent since their energies are equal and 
the scattering is assumed to be isotropic, ; 


Chapter IV 


Application of the Monte Carlo Method to the 
Investigation of Mass Service or Congestion 
Processes, Including Queueing 


1, GENERAL INFORMATION ON MASS-SERVICE 
PROBLEMS 


In the course of the last decade practical needs have stimulated 
intensive study of a specific class of problems concerning mass 
service. A typical feature of such problems, which arise most 
often in particle physics, telephony, production scheduling, plan- 
ning, automatic control of complex assemblies, etc., is the 
presence of a server system with which requests for service 
(calls) are placed at random instants of time. The server system 
has lines (channels) accomplishing the set of operations implied 
in the word ‘‘service,’’ An automatic refueling station can be 
considered as an example of a server system, Requests for 
service (calls) arise when cars arrive at the station for refueling. 
The lines are the individual gasoline pumps by means of which 
attendants carry out the refueling of cars, ensuring independently 
the full cycle of operations connected with serving a call. 

The object of mass-service theory is the study of the time- 
patterns arising in the process of serving a stream of calls forming 
an input in the system. This input stream is a sequence of calls 
with definite patterns of arrival times, 

If all calls of a given stream have, from the point of view of 
service, equal rights, the only question which arises at a given 
instant of time is whether the arrival of a call takes place or 
not, Such streams, which are called streams of homogeneous 
events, have been thoroughly studied and admit of elegant and 
convenient mathematical description. 

The solution of practical problems sometimes requires that 
the inhomogeneity of the stream of events be taken into account. 
The use of streams of homogeneous events as mathematical models 
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to describe the real stream of calls can prove in such cases 
inadequate, Some such examples will be specially discussed 
below. 

The general ideas and the methods of mass-service theory 
are used to formulate and to solve important applied problems, 
For this reason much work has been devoted to obtaining analytical 
relationships to enable service-performance indices to be evalu- 
ated on the basis of known parameters of the stream of calls and 
the characteristics of the server system. 

However, the relevant results obtained in the literature con- 
cern mainly comparatively simple cases, both with respect to the 
structure of the stream of calls andwithrespect to the properties 
of the server. Therefore, the analytical apparatus available today 
cannot fully satisfy the growing demands of practice, 

Investigations have shown that many problems connected with 
mass service can be solved by having recourse to the Monte Carlo 
method, 

A comparison of the potentialities of the analytical methods 
known today in the theory of mass service and the Monte Carlo 
method from the point of view of evaluating service~performance 
indices suggests the following considerations, 

Formulas and equations for evaluating service-performance 
indices (the probability of loss, the expectation of queue length, 
etc.) are, as a rule, of an asymptotic nature and are valid for 
instants of time sufficiently far removed from the beginning of 
service. Such solutions do not always prove satisfactory from a 
practical point of view. The fact is that real mass-service proc- 
esses exhibit transient behavior from the state with full availa- 
bility of the lines to the stationary state described by the asymp= 
totic formulas, The duration of the transient state is far from 
always negligibly small and is sometimes comparable with the 
duration of the service cycle typical of the system studied, 

The Monte Carlo method enables us to obtain estimates for 
the service-performance indices for arbitrary intervals of time, 
including intervals during which the behavior is transient. 

Existing analytical methods for solving problems of mass- 
service theory proceed from very restrictive assumptions on the 
distribution laws of the streams of homogeneous events repre- 
sentative of the stream of calls: the great majority of results 
concern the simple stream (the Poisson stream), The Monte 
Carlo method enables us to obtain an effective solution of the 
problem under materially wider assumptions on the nature of the 
stream of calls, The so-called streams with limited after-effect, 
which are defined below, prove particularly convenient in this 
respect, 
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Similar remarks can be made in connection with the structure 
of the server system, The class of such systems that lend them- 
selves to investigation by the Monte Carlo methodis substantially 
more general than the set of systems encompassed by known 
analytical methods, 

In comparison with analytical methods, the Monte Carlo method 
enables us to characterize with greater completeness the depend- 
ence of service-performance indices on the parameters of the 
stream of calls and of the server system, This is explained by 
the fact that in solving problems in mass-service theory by the 
Monte Carlo method we can use more extensive information on the 
process than is usually possible when analytical methods are 
employed. For example, for systems with lossthe usual formulas 
of mass-service theory give only the mean value of the fraction of 
losses, In this case, the Monte Carlo method enables us to obtain 
an estimate not only of the meanfractionof losses, but also of oe 
parameter of the distribution law of this fraction, 

Multiphase systems and multiphase mass=-service processes 
play an important role in practice. To this date no satisfactory 
analytical methods exist for investigating multiphase systems, The 
Monte Carlo method enables us to obtain the solution of problems 
for multiphase systems under very general assumptions on their 
structure, 

The essence of the Monte Carlo method in its applications to 
mass-service problems consists in the following. By means of 
special algorithms, samples of the assigned streams of homo- 
geneous events are generated and models of the functioning 
processes of the server system are constructed, These algorithms 
are used for repeatedly reproducing samples of the random 
service process under fixed conditions. Information on the process, 
accumulated by reproducing a set of samples, is subjected to 
statistical processing for estimating service performance indices, 
This chapter gives a brief account of those features of streams of 
calls and server systems which are needed for the solution of 
problems by the Monte Carlo method; in addition, methods of 
generating samples of random streams of calls and methods of 
constructing algorithms to simulate the service process are 
considered, 


2, MATHEMATICAL DESCRIPTION OF AN INPUT 
CONSISTING OF A STREAM OF CALLS 
REQUIRING SERVICE 


We shall begin our analysis ofthe problem of the mathematical 
description of a stream of calls with the simplest case in which 
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the stream can be assumed to be a stream of homogeneous 
events, 

Each event of the stream is characterized in this case by the 
instant of time ¢,; at which it occurs, To describe a stream of 
homogeneous events as a random process it suffices to assign 
the distribution law characterizing the sequence of random 
quantities t,, ty, 20.4 tmy ee 

It is often advantageous to consider, instead of the quantities 
bis toy wees tms eee, the random quantities §, &,..+s ms; +++, Which 
are the durations of the time intervals between the successive 
instants of time ¢,, 


edie (4.1) 


Below, we shall consider only continuous random quantities 
—, and shall characterize their occurrence by means of the joint 
probability density function f (z,; 293 ..-; Z,). 

Certain special classes of streams of homogeneous events play 
an important role in the applications, One isthe class of streams 
with limited after-effect, 

A random stream of homogeneous events is called a stream 
with limited after-effect if the random quantities £, are inde- 


pendent, 
It is evident that for streams with limited after-effect the 
joint probability-density function f(z, 2, ..., 2,) can be written 


in the form 
F(Z» Zqn ve es Z_) == Fy (2) fo (Zp) 0 » Fy (Zp) (4.2) 


The functions /;(z)) (i> 1) are conditional probability density 
functions of the quantities §, with the condition that at the initial 
instant of the interval é, (i> 1) a call has arrived, In contrast to 
this, the function /,(z,) is an unconditional probability density 
function, since no assumptions whatsoever are made asto whether 
at the initial instant of time a call has or has not arrived. 

Of great theoretical and practical interest are the so-called 
stationary streams, for which the probability structure does not 
depend on time, 

More precisely, a stream of homogeneous events is called 
stationary if the probability of the occurrence of k events during 
the interval of time (¢,, t,-+4) is independent of t, and depends only 
on ¢ and &, 
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Stationary streams with limited after-effect satisfy the relation 


f@O=f,@)= .-. =f,D=—f (). (4.3) 


This means that, for i>1, the intervals §, are identically 
distributed, An input consisting of such a stream is sometimes 
described as a general recurrent or general independent input. 

Let us consider the expectation Mofthe random quantity §, for 
i>l 


M= f ef @az. 
0 


The quantity M is the mean duration of the interval between 
successive calls, 

It can easily be seen that for recurrent streams, i.e., stationary 
streams with limited after-effect, the quantity 


1 


=a (4.4) 


is the mean number of events occurring per unit time. The 
parameter A is called the stream density. 

As an example of a recurrent stream consider the stream that 
has uniform distribution of the intervals of time between calls, In 
this case, the probability density function f(z) has the form 


fQ=Z O<z<b). (4.5) 


Since the expectation of the quantity § is equal to }/2, then the 
stream density of the stream assigned by the probability density 
function (4.5) is equal to 


2. 
1= 5 le (4.6) 


Another example of a recurrent streamisthe so-called simple 
stream (the Poisson stream or stream without after-effect), For 
a simple stream we have 

Ff (2)=he-®, (4.7) 


The parameter } is the density of the simple stream, 
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Recurrent streams satisfy the following relation (Palm’s 
formula, see [10]) connecting the probability density functions 
f(z) and f (2): 


jie — f ft) au]. (4.8) 
0 


By using (4.8) we can obtain the probability density function 
f, (2) for various recurrent streams, For example, for a stream 
with uniform distribution of the intervals 


or 
—2z, d 
fey 2G =iX(1 —F21). (4.9) 


For a Poisson stream we have 


niey=a|1 =r feral 


or 
FS (2) = he-™, (4,10) 


It follows from (4,10) that in the case of a Poisson stream we 
have f,(z,)= f(z) As canbe easily seen fromthe previous example, 
this is not true in general, 

We observe that the distribution lawofa stationary stream can 
be assigned in the form of a distributionlaw P(&, of the number & 
of events occurring during an interval oftime of duration ¢, For a 
Poisson stream P(k, f)is equal to 


k 
Pk, p=“ em, (4.11) 


i.e., it has the form of the Poisson law with parameter }f, 

We shall consider, as an example of anonstationary stream, a 
Poisson stream with variable parameter, The distribution law 
P(k, ty, t)of the number of events occurring during the interval of 
time (f, 44-+¢) has for such a type of stream the form 


Pb, ty p= AG OF pawn, (4,12) 


2. MATHEMATICAL DESCRIPTION OF AN INPUT 133 


The quantity A(t), f) is the expectation of the number of events 
occurring during the interval oftime (f,,4+4. The function A(f), ¢) 
can be represented in the form 


fott 
A (fy n= f h(u) dus (4,13) 
ty 


The quantity \(f,) is the instantaneous stream density at the 
instant f, and A(f), £) /f is the mean stream density in the interval 
of time (¢,, f+4). 

It can be shown that the probability density function /,(z,)for the 
first interval =, has the form 


A@d=A,0, ze *?™, (4.14) 
and, for an arbitrary interval, the form 
o(t, y= Az(t, zye *?, (4.15) 


We have considered thus far so-called ordinary streams of 
homogeneous events, A streamiscalledordinary ifthe probability 
(t,t) of the occurrence oftwo or more events during an interval of 
time (¢,, 4, +7) for arbitrary f is a small quantity in comparison 
with ¢, i.e., if 


lim 
t>0 

In applications that are connected with servicing grouped calls 
we sometimes encounter problems which give rise to clusters of 
events, i.e., nonordinary streams of homogeneous events have 
sometimes to be considered. In order to describe nonordinary 
streams, we must assign, in addition to the instants /,, the dis- 
tribution of the number of calls arriving at each of the instants of 
time ¢;, Inthe particular case when the number of events occurring 
is a random quantity independent of the instants f,, it suffices to 
assign the probability p, that at an arbitrary instant of time /; 
exactly k events occur, 

In solving certain very important applied problems the real 
streams of calls cannot be reduced unconditionally to streams of 
homogeneous events, To obtain the solution of such problems to 
an accuracy sufficient in practice, the inhomogeneity of the calls 
of the stream cannot be ignored, 

For example, if we consider the operation of a fleet of motor 
vehicles as a mass-service transport system, we must take into 
account those properties of the calls that substantially affect 
transport operating conditions, Such properties may comprise (in 
addition to the required initial instant of service): the coordinates 


(lo 
ST = 0. 
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of the loading and unloading posts, which determine the duration 
of the idle and the productive runs; the load characteristics (with 
respect to weight, size, etc.), which determine the types and 
number of vehicles required for serving calls; the characteristics 
of delivery terms, etc. 

Similar features are also of importance when other real 
processes connected with mass service are considered, 

Therefore, each call must be characterized inthe general case 
both by the instant of its arrival in the server system, ¢, and by a 
series of parameters a,, %, ..., 4, The modern apparatus of 
probability theory provides the means for a mathematical de- 
scription of random streams of calls in this general case. 

However, this is not required by the concrete problems that 
are considered in the present chapter, Therefore, we shall re- 
strict outselves to such particular cases as can be met in the 
solution of a number of practical problems. 

We shall represent the random stream of the instants ¢, of the 
occurrence of calls in the server system as a stream of homo- 
geneous events, The remaining random parameters of the calls 
O15 Oy vee,% Can be assigned by means of a conditional multi- 
variate distribution law 9(a;, a, ..., @,/t;). 

Since such a method of assigning a stream of calls is com- 
paratively complicated, it is advantageous sometimes to proceed 
to further simplifications, These simplifications usually consist in 
using for the instants ¢, streams of homogeneous events with 
limited after-effect, and assuming that some of the parameters 
@1, Gy, ...,%, are independent of ¢, and of each other. In spite of 
these important restrictions, such methods of assigning a stream 
of calls prove suitable for the solution of many practical problems, 


3. MASS-SERVICE SYSTEMS 


The solution of practical problems requires aninvestigation of 
the processes of functioning of various systems and aggregates 
connected with the mass servicing of calls. It is evident that all 
the variety of features characteristic of real processes cannot be 
taken into account in their mathematical description, We need only 
to take into account the basic factors that determine the course of 
the process, and can neglect second-order effects that are unim- 
portant from the point of view ofthe problem formulated, In other 
words, the mathematical description of the process of functioning 
of a real system must be based on some idealized scheme repro- 
ducing the basic features of the real process, 

A mass-service system consists in general of 2 lines capable 
of simultaneously and independently serving the calls. At an 
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arbitrary instant of time a line is foundin one of two states, either 
free or busy. 

Let us assume that at some instant of time a call arrives at the 
server system. If at this instant of time there are free lines then 
the call is admitted to service. Inthe opposite case, i.e., when all 
lines are busy, the call remains in the system for a certain time 
(let t, be the permanence time ofa call in the system) as aclaimant 
to service. Within an interval of time *, the call must be admitted 
to service, otherwise it is assumed to be lost (rejected). 

Depending on the value of t,, mass-service systems fall into 
three classes differing both inthe structure of the service process 
and in the mathematical formulation of relevant problems, Ift,=0, 
a call arriving at a certain time is either immediately admitted to 
service (if there are free lines) or rejected (if all lines are busy). 
Such mass-serice systems are called loss systems. The service- 
performance indices usually considered for loss systems are the 
probability of loss, the mean number of calls lost over a given 
period of time, etc. 

In the other limit case when tp = co, calls arriving at the system 
are never lost, but (if all lines are busy) wait ina queue until they are 
admitted to service. Such mass-service systems are called delay 
systems. Service-performance indices considered in this case can 
be the mean waiting time of a call, the mean queue length, etc. 

Finally, if 0<t),< oo, a call that finds all lines busy at the 
instant of its arrival waits for a time 1, in a queue, and when this 
time has elapsed (without any line being made available to it) the 
call is rejected. Such mass-~service systems are called mixed 
systems or combined loss and delay systems. The service- 
performance indices considered in this case are the probability 
characteristics of both the number of rejections and the waiting 
time as well as, sometimes, more complicated indices that take 
into account both these aspects of service performance. 

To characterize the properties of a server system we must 
assign, in addition to the timetp, alsothe time t,, namely, the time 
needed to serve a Call or, in other words, the service time of a 
line. A call admitted to service occupies one of the lines for a 
time tt); as soon as this time has elapsed the line is free and can 
proceed to serve another call, 

The quantities t and t, are usually considered to be random 
quantities with assigned distribution laws (or else with one assigned 
joint distribution law), The assumption can sometimes be made 
that one of them or both are fixed quantities. 

Let us proceed to consider various common versions of line- 
occupancy schedule. If in the mass-service system the incoming 
calls are queued, then the lines becoming free are immediately 
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occupied in the order of their becoming free. In the case when 
there is no queue of calls andthere are free lines, an incoming call 
may occupy one of the free lines according to special rules. The 
following rules are most often used in practice. 

1. The lines are occupied according to their serial numbers. 
A line with a large serialnumber cannot be enlisted for service to 
a call if there is a free line with a smaller serial number. 

2. The lines are occupied according to the order in which they 
become free. As a line becomes free it is entered in a queue and 
cannot be enlisted for service to a call as long as there are free 
lines that have become free earlier. 

3. The lines are occupied in random order according to as- 
signed probabilities. 

If at the instant of arrival of the next call there are & free 
lines, the probabilities p,,, p,,,..+» Pry of occupying the r,th, 
roth, ..., 7,th line respectively must be assigned. 

Similar assumptions can also be made concerning the order of 
admission to service for calls that are found in a queue. For ex- 
ample, it is natural to use the following rules or ‘‘queue disciplines’’: 

1. Calls are admitted to service in their queueing order. As a 
line becomes free, it proceeds to serve the call which arrived 
first in the system. 

2. Calls are admitted to service in the order of priority of 
their possible rejection. As a line becomes free, it admits to 
service the call that has used up the largest fractions of its 1, and 
which would be rejected the soonest if it were not admitted to 
service. We observe that the second rule coincides with the first 
one when «, is constant. 

3. Calls are admitted to service in a random order according 
to assigned probabilities. If at the instant of a line becoming free 
there are m calls inthe queue, thenwe must know the probabilities 
Pr» Pas +++s Pr, Of admitting to service calls with serial numbers 
Tis Toy seo, 7, VeSpectively. 

The description of versions of elementary idealized functioning 
schemes of mass-service systems can be restricted to the infor- 
mation above. In practice, other similar setups can also be met, 
which the reader can reduce to idealized form by analogy. 

Let us proceed to consider distinctive features that are typical 
of more complicated nonelementary functioning schemes of mass- 
service systems. 

Real mass-service processes are often multiphase, the server 
system consisting of a certain number of subsystems operating in 
tandem. As a rule, the following service discipline is usually ob- 
served: the next subsystem can proceed to servea call only when 
service at the previous phase has been carried through. 
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We sometimes find processes characterized by the fact that 
all calls (including those rejected at the previous phase) are ad- 
mitted to the following phase of service. Asa rule, however, a call 
that has been rejected at one phase is eliminated from any 
further service. 

We can consider, as an elementary example of multiphase 
service, the serving of customersinashop. In the first phase, the 
customer is served by the assistant at the counter who demon- 
strates the goods and prepares the bill. In the second phase the 
customer pays the cashier according to the bill. Only after this 
does the customer proceed to the third phase, the checking-and- 
pickup department. 

The above-mentioned rule, according to which the service 
operations of various phases are carried out in sequence, is not 
always observed in real multiphase service systems. Sometimes 
these operations are completely or partially simultaneous in time. 

A further generalization of elementary mass-service schemes 
is provided by the following considerations. 

When the inhomogeneity of the incoming callsisto be reckoned 
with, the distribution laws of the parameters +, and t, and of other 
characteristics of the server system are usually found to depend 
on quantities a,, o,, ..-, a,, describing the inhomogeneity of the 
calls (see the end of the previous section), or on their probability 
characteristics. This generalization is not one of principle, in 
that, strictly speaking, the case considered may be reduced to the 
elementary one; it does, however, substantially affect the pro- 
cedure of solution of problems. 

In just the same manner it may be found that the probabilities 
determining the order of enlisting lines andthe order of admitting 
calls to service do not remain constant but depend on the parame- 
terS Tp, t). Oy» 4, +++,a, OF else on their probability character- 
istics. 

It is sometimes advantageous to consider instead of two possible 
states of a line—free or busy—a greater number of possible states. 
For example: a line is free and ready to serve calls, a line is 
busy, a line is free but not ready to serve calls, etc. 

The possibility of wastage not connected with the occupancy 
conditions of the lines must be reckoned with in some mass- 
service systems. In this case the probability characteristics of 
the number of rejections or losses will depend substantially on 
the probability g of occurrence of wastage, which in the general 
case is a function of the parameters tp, t, %, %y «++, Zp, etc. 

Finally, an important factor accompanying the functioning 
processes of many real mass-service systems is the reliability 
of the apparatus and equipment. 
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From the point of view of reliability, at a certain instant of 
time a line can inthe simplest case either be in good working order 
or be in disrepair. The reliability of a line may be characterized 
by the probability of reliable operation R= R(t), whichdepends on 
time. In certain cases, a line put out of operation quits the service 
process; in other cases it may be put again into service after a 
time t,. The repair time t, is usually assumed to be a random 
quantity with assigned distribution law. In generalt, may depend 
on parameters of the flow and of the server system. 

The fate of a call during whose service breakdown of a line 
occurs deserves some comment. The most common assumptions 
in this connection are the following: that the call is rejected; that 
the call remains in the system (with a total permanence time not 
greater than t,) as a claimant to service outside the queue; that 
the call is enlisted in the queue, etc. 

We have enumerated here only the main generalizations of 
elementary service schemes, such as are met in investigating 
real processes. These generalizations enable us to encompass a 
series of very important practical problems. When recourse is 
made to the Monte Carlo method, these generalizations do not 
hinder the effective solution of problems but only lead to some 
complication in the algorithms used to simulate the system. 


4. THE GENERATION OF RANDOM STREAMS 
OF CALLS 


An essential element of the investigation of mass-service 
processes by the Monte Carlo method is the generation of random 
streams of calls. Simplicity and compactness of the algorithms 
used for generating streams of calls are most often the factor that 
decides the actual feasibility of an investigation to be undertaken. 

We shall begin our account of methods of generating streams 
of calls with the simple case in which the calls form a stream of 
homogeneous events. 

The joint distribution law f(z, 2), ..., z,) of the random quanti- 
ties &, f&,..«» &%, i.e., Of the intervals of time between successive 
instants of occurrence of calls (4.1), will be assumed to be given. 
In order to obtain a sample stream of homogeneous events /,, 
ty, «+e, ty, We must generate a sample z,, z.,..., 2, Of the k-di- 
mensional random vector {,, &,..-, & and evaluate ¢, according 
to (4.1). Methods of generating random vectors are considered in 
Chapter VII. As is well-known, this operation proves very cumber- 
some if & is large. This fact substantially restricts the use of 
streams of homogeneous events of general type in the solution of 
problems of mass-service theory by the Monte Carlo method. 
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The procedure of generating samples of streams of homo- 
geneous events is considerably simplified in the case of recurrent 
streams. 

Let a recurrent ordinary stream beassignedby the probability 
density f(z) (4.3). The probability density /,(z,) for the first 
interval z, will be found from (4.8). Now, according to the rules 
considered in Chapter VII, we can generate a random number 2, 
corresponding to the probability density f,(z,)and obtainthe instant 
of occurrence of the first call ¢;—z, Then we generate a series 
of random numbers corresponding to the probability density f(z), 
and evaluate by means of the relation (4.1) the quantities ¢,, ¢,, 
..-, ¢,. The procedure described is illustrated below by a number 
of examples. The streams of homogeneous events considered as 
examples can be used as typical mathematical schemes for an 
approximate description of various streams met in the solution of 
practical problems. 

Below, we shall assume to have available random numbers R, 
with uniform distribution in the interval (0, 1). As is well-known 
(see Chapter VID), to obtain random numbers x; with probability 
density f (x), we must solve with respect to x, the equation 


wt 


f s@ax=R, (4.16) 


Relation (4.16) will be used, in addition to other methods, for 
generating streams of homogeneous events. 

The simple stream. The probability density function / (z) of the 
intervals between calls, §, for i>1, has, for the simple stream, 
the form 


f(z) =he-*, (4.7) 


Owing to (4.10) the same expression is validfor the probability 
density functions, (z,) of the first interval §,. Therefore, the genera- 
tion of samples of a simple stream of homogeneous events can be 
reduced to the particular case of generating a sequence of inde- 
pendent random numbers having the exponential distribution (4.7). 

To do this we use relation (4.16) 

2 


h e—* dz=R, 
! 


or 
1—e"*i = R,. (4.17) 
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By solving equation (4.17) with respect to z, we obtain 


z=—1ing—R). (4.18) 

Then, to obtain the corresponding sequence of the instants of 
arrival of calls t,, fp, ..., ty, +++, We use (4.1), 

We observe that, to find the values of z,; in accordance with 
(4.18), a comparatively large number of operations on general- 
purpose electronic digital computers must be performed. This is 
explained by the fact that the evaluation of the logarithms by 
general-purpose digital computers is carried out by means of 
standard programs based on power-series expansions. There- 
fore, to generate a sequence of random numbers with exponential 
distribution law, recourse is often had to approximate methods 
(Chapter VII). 

In the particular case ofa simple stream certain other methods 
of generating samples can also be recommended, For example, 
for small values of 4(in practice for } < 0.5) the following approxi- 
mate method proves advantageous. Its key idea is based on 
simulating the conditions of the corresponding limit theorem. 

The units of time (e.g., minutes) usedinthis problem are given 
serial numbers 1, 2, ..., m, ... Let us split each unit of time 
into equal parts of length <= 6/,, where O0<.9<4 <1 and 2/4 is an 
integer. Within each unit of time we enumerate the intervals 
obtained: 1, 2,...,/,..., 4/0. Thenweuse the following random- 
test procedure, From a set of random numbers with uniform dis- 
tribution over the interval (0, 1)weselecta random number R; and 
test the inequality 


R, <8. (4.19) 


If the inequality (4.19) is satisfied, we assume that a call has 
arrived at the instant of time 


t,=(m —1)+-(—1)s + ZR, (4.20) 


If the inequality (4.19) is not satisfied, we assume that no call 
has arrived, and proceed to the next random number R;,;. 

Beginning with the zero instant of time (m—=1; j/—1), we check, 
by means of the above-described tests, whether acall has arrived 
during the first interval (of duration <) ofthe first unit of time. If 
the inequality (4.19) is satisfied, the instant of arrival of the first 
call is 


== R, 
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If the inequality (4.19) is not satisfied, we proceed to the second 
interval (of duration z) of the first unit of time (#—=—1; j/—2), etc., 
until all intervals of the first unit oftime are exhausted. Then we 
proceed to the second unit of time (m=2; j= 1). etc. 

In order to ensure a closer compliance of the distribution law 
of the stream thus formed with the Poisson distribution law, we 
must reduce §. An obstacletothis isthe increase of the volume of 
computations which, for very small:=—4/), (practically :<0.05), 
makes the procedure no less cumbersome than the calculation of 
z, according to (4.18). 

In the solution of problems connected with mass service of 
concrete realizations of the simple stream, the method most 
acceptable in practice for generating samples is usually that 
considered in Chapter VII, which is basedona piecewise approxi- 
mation to f(z). 

Stream with uniform distribution of the intervals between calls. 
The probability density function f(z) forthe intervals :,,for i> 1, 
for such a recurrent stream has the form (4.5) 


f@=t O<z<m, 


and the probability density function/,(z,) of the first interval §, has 
the form (4.9) 


26 — dr 
f@= C5 =) =i (1 —3%): 
where 
2 


k=. 


0 
We observe that the expectation of the duration of the first 
interval is 
b 
M g => 3 . 


The procedure for generating samples of a stream with uniform 
distribution of the intervals between calls reduces to the following. 
To obtain values of the first interval z, we use relation (4.16) 


a 
2 
-f (b—z)dz=R,. 
0 
Then 
2,=6(1—V1—R)), (4.21) 


where R,; are random numbers with uniform distribution over the 
interval (0, 1). 
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The expenditure of a comparatively large number of machine 
operations to calculate expression (4.21) is of little importance, 
since this is to be donevery rarely, namely, once for each sample 
of the stream. 

Random numbers having uniform distribution over the interval 
(0, 5) are used as possible values of z, (i> 1). It is evident that 
they can be obtained by means of the transformation 


2 = ORy. (4.22) 


Samples of a stream with uniform distribution of the intervals 
between calls are thus simply and conveniently obtainedby means 
of general-purpose electronic digital computers, 

Erlangian streams. An Erlangian stream of order m is an 
ordinary recurrent stream for which the probability density func- 
tion of the intervals between calls §, for ¢>1 has the form 


fO= Goyer te. (4.23) 


It can be shown that the density of an Erlangian stream is 
equal to 


i= 


m* 


(4.24) 


The use of approximate methods considered in Chapter VII is in 
general recommended for obtaining random values z,, of the first 
interval §. 

The process of obtaining values z,, fortheintervals §, for i>1 
may, however, be simplified if the following fact is taken into 
account. The intervals §, for i> 1ofanErlangian stream of order 
m are the sums of m independent components £,, each of which 
has an exponential distribution with parameter }*. 

Thus, the intervals z,; for an Erlangian stream can be obtained 
by adding m sequences of values of £,. 

We observe that the generation of samples of an Erlangian 
stream by means of electronic digital machines proves rather 
cumbersome. 

Below, we consider some streams which depend ontwo or three 
parameters and have distinct advantages over Erlangian streams 
from the point of view of thecaseof generating samples by means 
of electronic digital machines. 

First generalization of the Erlangian stream. Let us consider 
an ordinary recurrent stream for which the intervals §, between 
calls for ¢>1 are the sums of two random quantities subject to an 
exponential distribution law with parameters i, and ). 
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It can be shown that the probability density function f(z) for 
such a stream will have the form 


Aye 
f= hee = ; (4.25) 
The stream density is expressed by the relation 
— Aide 
h= La (4.26) 


The probability density function for the first interval has the 
form 


(ye—h#1 — dye-#%1), (4.27) 


Sy (2) = ra * 


In order to obtain possible values 2, of the intervals &, for 
i> 1, the procedure may beusedof adding up sequences of random 
numbers with exponential distributions reduced to the correspond- 
ing values of i. 

In order to obtain random values z,, ofthe first interval =, we 
‘must substitute (4.27) in (4.16). From this we obtain the relation 


#? # 
ry ey A al a a (4.28) 
2 1 2 1 
which can be solved with respect to z,, by a method of successive 
approximations. 

If the intervals between calls §, for i>1, are the sums of 
three components having exponential distribution with parame- 
ters ),, A, and A, respectively, we shall have, instead of relations 
(4.25), (4.26) and (4.27), the following ones: 


e7 ht e7* 
F@= Mos [weir oY) as 7 i ) “ =a 
e~ ; 
apy iF (4.29) 
dj ded 
=a tu” (4.30) 
i Aydehs [ Aghse "4 
A@= Tia, Di Tala a—ioe my + 
dj Age 2 Ade "st 4.31 
(Ay — Ag) (As — Ao) (A, — As) (Ag — Ag) | ‘ ( ) 


Second generalization of an Erlangian stream. Letus consider 
a recurrent ordinary stream for whichtheintervals §,for i> 1 are 
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the sums of two independent random variables having uniform dis- 
tribution over the intervals (0, 6,;) and (0, 4), respectively. 

If 6,< 4, the probability density function f(z) for intervals 
& (> 1) has the form 


2 
[5- O<z<b), 
ro} 2 (<z<h), (4.32) 
aaeee (, <2 <b, +5,). | 


The density of such a stream has the form 


2 


i= 755," 


(4.33) 


The probability density function for the first interval has the 
form 


26, b,— z? 


GhaG, Fey OS FSP 
2 (bg —2) +b; 
fi(Z)=q 92 (1 + be) (1 <2 <b): (4.34) 
zo (z—b,)_— by — by 
b,bz (6; + 49) bib, bby 
(b, <2 <j + by). 


The procedure for obtaining possible values of z,; of the 
intervals =; for i>1 reduces to adding random numbers having 
uniform distribution over the intervals (0, b,) and (0, 5,), respec- 
tively. 

To generate values of the first interval § we can uSe the 
approximate methods considered in Chapter VII. 

Stream with a fixed minimum interval. To solve applied 
problems we have sometimes to consider streams for which the 
intervals =, for i>1 are the sums of a constant a and a random 
quantity §. 

For example, if § has an exponential distribution with parame- 
ter 2*, the probability density function /(z) for the interval:, for 
i> 1 has the form 


for O<z<a 
f (2) =| )*e-* (z-e) for z = a. (4.35) 
The density of such a stream is equal to 
ja (4.36) 


1+ ar* * 
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The probability density function for the first interval has the 
form 


* 
Tear * OSs ee 4,37 
Pe (4.37) 
Taare 1-4) for 2, >a. 

The procedure for generating values z,, for i>1 for sucha 
stream is evident: we need to add up the constant a and possible 
values x,, of the random quantity = having an exponential distribu- 
tion with parameter }*. The quantity \* can be evaluated from the 
relation (4.36) if the required stream density 4 is assigned. 

In order to obtainsamples z,, ofthefirst interval ¢, with prob- 
ability density function (4.37) we can proceed in the following 
manner: find the probabilities p(0, a) andp(a, co)for the value of 
& to be found in the respective intervals and apply relation (4.16) 
to (4.37). The following expressions are obtained for the proba- 
bilitiesp(0, a) and p(a, 9): 


pO. =e (4.38) 
P(e, ©) = Ta (4.39) 
By using (4.16) we find 
x fdu=r OO<R,< pO, a)), (4.40) 
T+ ai. ; ae 
0 


and 


pO, pas fewu-rdu=R, (pQ, a) <R, <1). (4.41) 


a 


By solving Eqs. (4.40) and (4.41) with respect to z, we obtain 
2y=(a+)R, (4.42) 

and 
2y=a——, Inf{1—( +a) 18, — pO, al}. (4.43) 


The procedure for generating values of z, consists in the 
following. From a set of random numbers with uniform distribution 
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over the interval (0,1) weselecta random number R,; and compare 
it with p(0, a). If 


Ri < PO. a), (4.44) 


we evaluate 2, according to (4.42). If the inequality (4.44) is not 
satisfied, then to evaluate z,, we use (4.43). 

Stream with variable parameter. We shall consider one of the 
possible methods of generating samples fora Poissonstream with 
variable parameter (4.12). 

To obtain possible values z, of the intervals between calls we 
use relation (4.16). Let us apply it to the probability density 
function (4.15), Then 


A(t, 2)=—In(l1—R), (4.45) 


where R, is a random number having a uniform distribution over 
the interval (0, 1). 

It can easily be shown, by using the probability density function 
(4.14), that relation (4.45) is valid not only for z, for i>1 but 
also in the case of the first interval §, if we put ()=0. 

By solving equation (4.45) with respect to z, the required 
sequence of quantities 2, can be obtained. 

We shall discuss in some detail only a very simple example. 
Let the instantaneous stream density \(¢) be linearly dependent on 
time 


h(@)= at-Lo. (4.46) 
Then, by virtue of (4.13), 
A(t, 2) = 4+-2(at-+5), (4.47) 


and expression (4.45) assumes the form 


2 (at-+6)=—In(1—R). (4.48) 
By taking t= 0 for the first interval §,, we obtain 


#1 b= —in(1—R) (4.49) 
or 


St Ren Gh (4.50) 


41> a 
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For the second interval § we put ¢=z,. Then 
az; 
=z + (az, + 6) = —In (1 — Rj) (4.51) 


or 


ges — (az, + 6) + V (az, + 6)? — 2ain (1 — Rj) 
»' Sacaass a SSS 
i-1 
For the interval of serial number i we put t= Dey Then 
=] 


i-1 fm. |... 
—(«3 ato) 4 V (Sats) — 2aIn(i— Ry) 
2.>= at (4.52) 

In a similar manner we can obtaina z, sequence also for other 
expressions for \(¢) different from (4.46). 

Note on nonordinary streams. If the number of calls arriving 
at an instant of time ¢; is a random quantity independent of /,;, the 
following procedure may be recommended for obtaining possible 
values of the number of calls k. Let the probability that & calls 


arrive at the instant ¢, be equal top,; >i p,= 1. Then the 
k 


quantity & must be selected by lot according to the probabilities p,. 
To do this, we select the random number k; having uniform dis- 
tribution over the interval (0, 1) and compare it with successive 
sums of p,. 

The number of calls is equal to & in the case when 


> 
| 


1 


& 
P< Ry < DB Py (4.53) 


LM 


t 


where p)=0. 

Note on streams of a more general nature. If calls are char- 
acterized not only by their instant of arrival f; in the system but 
also by the parameters a!), a2), ..., a’, then the generation of 
samples of the stream proves more complicated. 

Let the stream of instants ¢, be a stream with limited after- 
effect, and let the parameters a‘, «!#/, ..., al™ be random quantities 
independent of t;. We shall assume that some of the quantities a”) 
are fixed, others are independent of the remaining ones, and some 
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are mutually dependent. With this assumption, we must assign 
distribution laws for the independent parameters 2\") andjoint dis- 
tribution laws for the dependent ones. 

The generation of samples of a stream of such a type consists 
in the following. A stream of instants ¢,is generated, as a stream 
of homogeneous events with limited after-effect, according to the 
rules considered above. For each call arriving at an instant of 
time /,, we choose random values ofthe parameters a(!), a), ..., alm 
by using the methods for obtaining random numbers and multi- 
dimensional random vectors with assigned distribution laws con- 
sidered in Chapter VII. 


5. STRUCTURE OF AN ALGORITHM FOR SOLVING 
MASS-SERVICE PROBLEMS BY THE 
MONTE CARLO METHOD 


The solution of mass~service problems by the Monte Carlo 
method in the form considered amounts essentially to simulating, 
by means of general-purpose digital computers, the process of 
serving calls arriving at the system. 

General rules for the construction of simulating algorithms are 
not yet available. There are, however, various methods that enable 
us to represent a formalized service process in the form ofa 
sequence of operations (or group of operations) accomplished by 
machine, 

The structure of an algorithm to simulate the functioning of a 
typical mass-service system will be considered here by means of 
a concrete example, 

Suppose the mass-service system consists of x lines at which 
calls arrive at random instants of time ¢;. The calls form an 
ordinary recurrent stream of homogeneous events. [If at the 
instant tf, there are free lines, a call is admitted to service and 
occupies only one of the lines for a time t. A call that finds all 
lines busy remains in the systemforatime +). If during this time 
none of the lines starts to serve the call, the call is rejected. 

As lines become free, and in that order, they are occupied by 
calls arriving at the system; a given linecannot be occupied while 
there remain free lines that have become free earlier. 

If at the instant of a line becoming free there are calls in the 
queue, the call admitted to service is the one that would be first 
rejected if it were not admitted to service. 

Figure 27 shows the logical block diagram of an algorithm 
for simulating the process of serving calls of the system described 
above. Each operator is, as a rule, a subalgorithm which imple- 
ments in the simulating process a determinate function of the 
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Fig. 27. Block diagram of the simulated algorithm. 


in cells for tj 


system. Logical operators are shown with circles. If a condition 
tested by a given logical operator is fulfilled, the arrow denoting 
the direction of control is qualified by the index 1, in the opposite 
ease by the index 0. 

The sequence of operations in the realization of such an 
algorithm may be described as follows. 

The operator 1 generates the instants ft; of arrival of calls in 
the server system. Methods of generating streams of calls have 
been considered in the previous section. 

Suppose the operator 1 has generated the next value in turn, 
t,. Control is transferred to the operator 2. By comparing with 
zero the value & generated by the operators 11 and 13 for the 
number of calls in the queue, the operator 2 produces an index 
equal to 1 if there is a queue, and equal to zero if there is no 
queue of calls. 
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Let us first consider the branch ofthe algorithm corresponding 
to the absence of a queue. In this case, control is transferred to 
the operator 3, which compares the quantity ¢, with the constant 
T, namely, the duration of the interval of time during which the 
service process is to be investigated. If ¢,< 7, thecall belongs to 
the portion of the stream thatistobe investigated. In the opposite 
case the call is not considered. 

Let t;< T. Then control is transferred to the operator 4, 
which compares the instants at which the lines are freed 
(these are generated by the operator 8) and determines the 
smallest of them. Control is transferred to the operator 5, which 
establishes whether ¢, is smaller than the minimum value t,,.,.. If 
it is, this means that at the instant of the call all lines are found 
busy; in the opposite case there is at least one free line. 

Let us first consider the case when a free line is available, 
Control is transferred to the operator 6, which selects, according 
to the assigned rule, a line for serving the given call. Then, a 
random value is generated for the servicetime +, (the operator 7) 
as a random number with given distribution law; the time for the 
line to become free is calculated (to the instant ¢,, which in this 
case is the instant of start of service, there is added 4; the 
operator 8), and the algorithm is made ready to simulate the 
service process of the next call (the operator 9), 

The passage to the next call can be accomplished in two ways. 
In the first case we pass from the operator 10 along the arrow with 
index 0 to the operator 1 if the instant of arrival of the next call 
has not been generated. The second case occurs when the instant 
of arrival of the next callhasalready been generated and recorded: 
either during the time between subsequent instants of lines becom- 
ing free a number of calls have arrived (t; is recorded by the 
operator 14) or thenextcallhasa ¢, that is larger than the instant 
at which the given line became free and therefore it is not con- 
sidered as a claimant to service by the given line (¢] is recorded 
by the operator 25). If it is found that there are already values 
of ¢} or ¢, (this fact is verified by the operator 10), then control is 
transferred from the operator 10 along the arrow with index 1 to 
the operator 11. It isthe task ofthe operator 11 to establish the new 
value of the quantity k, namely, the number of calls in the 
queue. 

The need for this arises in connection with the fact that the 
values of ¢; and ¢ are rewritten in cells in which the instants of 
arrival of calls ¢,(the operator 12) are usually stored, Con- 
trol is then transferred to the operator 2. This concludes the 
analysis of the sequence of operators (beginning withthe operator 
2) for the case when at the instant /,; there is no queue, 
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Let us return to the operator 5. Let us assume now that the 
condition tested by the operator 5 is satisfied. This means that at 
the instant of arrival of the call ¢, all lines have been found busy. 
Then the following facts must be taken into account. Although at 
the instant ¢; there was no queue of calls, still during the interval 
of time between ¢, and the instant of a line becoming free several 
more calls may arrive, and then, at the instant of a line becoming 
free, a queue of calls may be present. Therefore we pass from 
the operator 5 along the arrow with index 1 to the operator 13 
which adds unity to the value of & (the number of calls in the 
queue); we record ¢, with the notation ff (the operator 14) and pass 
to the operator 1 for generating the next instant of arrival of a 
eall. 

Since now in the queue there is at least one call, the operation 
of the algorithm can be continued from the operator 2 only in the 
direction of the operator 22. However, before considering the 
chain of operators for the case of presence of a queue of calls, we 
shall return to the operator 3. 

Let the condition t, << T tested by the operator 3 be found to be 
not satisfied. This means that the call arriving at the instant 
t, does not belong to the portion of the stream being investigated 
and therefore is not considered. At the same time, this means 
that the given stream sample has been exhausted and we have to 
pass to the next sample if the given one is not the last. We pass, 
therefore, from the operator3 along the arrow with index 0 to the 
operator 15. If N= N*(theconditiontestedby the operator 15 was 
not satisfied; the given sample provedto be the last), we pass from 
the operator 15 along the arrow with index 0 to the processing 
(the operator 18) and issuing (the operator 19) of the results of 
simulation, 

In the case when N < N*, thework of the algorithm is continued 
(starting from the operator 15) in the direction of the arrow with 
index 1. Here the algorithm is made ready to simulate the process 
of serving a new sample of the stream of calls (the operator 16), 
the number N of samples investigated is recorded (the operator 17), 
and then control is transferred to the operator 1. 

Let us now proceed to consider the branch of the algorithm 
for the case when there is a queue of calls. 

If k>0O (the operator 2), we pass to the operator 22. Suppose 
that the condition tested by the operator 22 is satisfied. Then 
there remains to compare (the operator 24) ¢, with the smallest 
time for a line to become free, which time is generated by the 
operator 23. If ¢, is smaller than the time for a line to become 
free, it can be found that several more calls have the same 
property, i.e., arrive earlier than a linebecomesfree. Therefore 
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we pass from the operator 24 in the direction of the arrow with 
the index 1 to the subchain of operators considered earlier (13, 14 
and 1). The work of the algorithm will continue cyclically (the 
operators 1, 2, 22, 23, 24, 13, 14,1, etc.) until the next ¢, is found 
larger than the time for a line to be free, In this case the buildup 
of the queue is interrupted and the last value of t, is recorded 
with the notation ¢, by the operator 25, We observe that a call that 
has arrived at the instant ¢, (after the given line has become free) 
is not a claimant to service by the given line. Thus, at the instant 
of a line becoming free there is a certain number &’ of calls in 
the queue which are claimants to service by the given line. The 
instants 4 of their arrival in the system are recorded by the 
operator 14. 

The operator 26 generates, forallcalls arriving at the instants 
t?, random values of the permanence time inthe system Ther 1 a8 
random numbers with assigned distribution law. 

By adding the values of t per » to ¢7 (the operator 27) we obtain 
the instants of time ¢,; at which calls can be rejected if they have 
not been admitted to service. 

The operator 28 compares the value of &* with unity. If at the 
instant of a line becoming free there were found several calls in 
the queue (&* > 1), we need to determine the smallest value t,,; (the 
operator 29), since the line that becomes free must proceed to 
serve calls with the least value of ¢.. If k*=1, control is trans- 
ferred directly to the operator 30. The operator 30 compares the 
value ¢;; with the time ¢ fe. of a line becoming free. The perma- 
nence time in the system for callsthat satisfy the condition tested 
by the operator 30 elapses before a linebecomes free. Therefore 
such calls will not be served (are rejected). 

The number of rejected calls (losses) is recordedby the oper- 
ator 31. Thenthe rejectedcallis eliminated (the operator 21) from 
the number of claimants to service, and control is transferred to 
the operator 20. The operation of the algorithm proceeds cyclically 
(the operators 30, 31, 21, 20, 28, 29, 30, etc.) until either all 
rejected calls are eliminated from consideration andthe condition 
tested by the operator 30 isfoundnotto be satisfied by the follow- 
ing call, or else k* becomes equal to zero. 

Let t.=ttree- Then the possibility of serving the call exists. 
In this case control is transferred from the operator 30 in the 
direction of the arrow with index 0 tothe operator 6 and operation 
of the algorithm proceeds along the chain considered earlier. 

It may be found, onthe other hand, that k*=0 (the operator 20). 
This means that allcalls that areclaimantsto service by the given 
line are rejected. Then along the arrow with index 0 control is 
transferred to the operator 9 for passing to the next call. 
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We observe that if the condition tested by the operator 22 is not 
satisfied (i.e., the call arriving at the instant /, is eliminated 
from consideration) then, along the arrow withindex0, we pass to 
the processing of the calls that are in the queue (the operator 26 
and following ones). 

Algorithms for simulating call-service processes can be de- 
signed in a similar manner also for other types of mass-service 
systems. 

The structure of a simulating algorithm for the case when calls 
are served in their queueing order and stoppages of the apparatus 
occur owing to its inadequate reliability is considered in [11]. 

In concluding, we observe that the same design principles can 
be used for the simulating algorithm for the service processes of 
calls with substantial inhomogeneity, when each callis character- 
ized not only by ¢, butalsobya series of parameters «,, a), «++, Se 
Typical features of these algorithms are usually the presence 
of operators that find the values of t,, 0, Tree, ts, etc., aS a func- 
tion of the parameters ,, a,,...,a,,a5 well as more complicated 
rules for determining the order of serving calls and the order of 
succession of the lines used. 

Algorithms to simulate call-service processes are imple- 
mented fairly efficiently by modern general-purpose digital 
computers. 


6. CONSIDERATIONS ON THE PROCESSING 
OF THE RESULTS OF SIMULATION 


The Monte Carlo method enables us to carry out sufficiently 
thorough investigations of mass-service processes of calls of a 
given stream. In formulating a problem in the study of mass- 
service processes, we must specify the quantities to be used as 
service-performance indices, since the structure of the simulating 
algorithm and the method of solution of problems often depend on 
our choice of the quantities to be evaluated. 

For example, for systems with losses, parameters of the dis- 
tribution law of the fraction of losses are usually employed as 
service-performance indices. The simplest and most useful index 
of such a kind is the mean value of the fraction of losses R(t, f) 
over intervals of time (¢,, ¢;-+¢).We shall discuss briefly the mean- 
ing of this quantity. Letus consider a set of samples of the service 
process over an interval of time (¢,, 4,+4. The number of calls 
arriving in the system during this interval of time for a sample 
chosen at random is a random quantity. 
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Let the mean value of this quantity be N(f,, ¢). We shall denote 
by m(f,. ) the mean number of losses during this same interval of 
time. Then 


— (to t) 
Rly D= Fp. (4.54) 

In the case of a stationary stream of calls the quantity N(t). 4 
does not depend on ¢, and can be written in the form 


N=it, (4.55) 


where } is the density of the stream of calls. 

For mass-service systems with constant parameters and for 
instants of time sufficiently far removed from the beginning of 
service, the quantity m(¢), ‘) will also be independent of ¢, and can 
be written as 

m= hej (4.56) 
where \;,j is the density of the stream of lost calls. 

Then the mean fraction of losses A is a constant quantity 


a (4.57) 


We observed that in the case when relation (4.57) is valid, the 
quantity R can also be interpreted as the probability of loss for a 
call arriving in the system at an arbitrary instant of time. 

The service-performance indices usually considered for delay 
systems are the mean value of the waiting time or the mean value 
of queue length. For mixed mass-service systems both groups of 
quantities are usually employed. 

Estimates for the service-performance indices canbe obtained 
from results of repeated simulation. 

Let us consider, for example, the estimation of parameters 
connected with the distribution law of the number of losses. We 
shall assume that in the simulating process the number of investi- 
gated samples N is counted and that, in addition, the number of 
arriving calls n, and the number of lost calls are recorded for 
each sample. 

The last information can be used to calculate the number of 
samples at), m,, m),..., m, forwhich0,1,2,...,% losses have been 
recorded respectively. It is evident that we cantake as estimates 
for the probabilities p,, which assign the distribution law of the 
number of losses, the quantities 


y= (4.58) 
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The service-performance indices usually adopted can be ex- 
pressed in terms of these quantities. Thus, for example, the 
fraction of stream samples for whichthere has been found at least 
one loss has the form 


p1—=3. (4.59) 


The mean value of the number of losses occurring in a single 
sample can be evaluated at 


mos Sit, (4.60) 
vf 


where the summation is carried out over all values of /, 
The estimated mean number of calls arriving in the system 
during one sample will be 


N 
nm S) ny. (4.61) 


R=, (4.62) 


When the only service-performance index of interest is the 
mean fraction of losses, the procedure of recording and processing 
of the results of simulation may be simplified. In fact, we shall 
record only the number of calls n* arriving in the system during 
all VN samples of the stream, and the number m* of those of them 
that are lost. Then 


rey (4.63) 
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Chapter V 


Application of the Monte Carlo Method 
to Information Theory 


The development of communications, telemetry, radar and 
other branches of engineering has raised the need for a theoretical 
investigation of statistical phenomena connected with the reception 
and deciphering of transmitted information. As a result, a new 
branch of science has arisen, variously referredtoas ‘‘statistical 
communication theory,’’ ‘‘theory of random signals and noise,’’ 
ete. We shall refer to it as information theory. 

Problems of informationtheory form one of the important fields 
of application of the Monte Carlo method. We have selected for 
discussion in this chapter typical problems of information theory 
that require the use of this method. No systematic account of in- 
formation theory will be given here. 

Attention has been devoted mainly to the analytical aspect of 
the problem, since a simple examination of the expressions (for 
example, integrals) which we encountered in this approach con- 
vinces us that classical methods are unsuitable and suggests 
the use of the Monte Carlo method. 

The greater part of this chapter is devotedto evaluating multi- 
ple integrals encountered in an important branch of information 
theory, namely detection theory. 

In the last section of the chapter we consider methods of 
simulating certain information transmission processes that do not 
lend themselves to investigation by known analytical methods. 


1. STATISTICAL PROPERTIES OF SIGNALS AND NOISE 


Communication systems of all types can be reduced to the 
single typical diagram shown in Fig. 28. The transmission of 
useful information* is accomplished in engineering by means of 


*By this term we mean not only explicit messages but also any information of 
interest to the recipient, for example, the very fact of the presence of a signal, 
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electromagnetic, acoustical, and other means. Whatever the 
chosen physical carriers of information, at the transmitting end 
of the communication line there are generated suitable oscillations 
(electromagnetic, acoustical, etc.) which are transmitted along 
the communication channel.* The types of oscillations used are 
very varied, from the simplest monochromatic oscillations to 
complex ones having arbitrarily wide and complicated spectra. 


Noise source 


Source Transmitter Received Receiver Recipient 
signal 


Signa! 


Message Communications Message 
channel 


Fig. 28. Diagram of communication system. 


As it passes through the communication channel the useful 
signal is distorted by noise, while as itpasses through the receiver 
the fluctuation noise of the amplifier is superimposed on it. The 
information contained in the signal received is assessed at the 
output of the receiver by measuring the values of one or several 
physical quantities. 

The simplest and most often used method of signal transmission 
is the excitation of monochromatic oscillations, in whichcase the 
quantity transmitted s depends on time in the following manner: 


s(t)= Acos(At-+ 9), (5.1) 


where A is the signal amplitude, 4 is the frequency and ¢ is the 
initial phase. The transmission of such a signal enables the recipi- 
ent of the message to measure, inthebest case, three parameters. 
Therefore, in order to transmit various messages, oneor several 
of these parameters have to be varied with time (the signal must 
be ‘‘modulated’’). By observing the behavior of this process, the 
recipient must be able to extract the message transmitted. We 
shall consider only the method of discrete modulation, in which 
the information transmitted by the signalis encodedin the form of 
successive packets of sinusoidal oscillations differing from each 
other in any given parameter, for example, inamplitude. Each of 
such packets will be termed asignalelement, and the signal itself 


*The communication channel can be any medium in which oscillations carrying 
the useful information are propagated. 
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will be considered as a time sequence of such elements. Physical 
elements are interpreted as variations of electrical voltage. Nu- 
merical values can be assigned to a sinusoidal pulse by describing 
any or all of four variables: duration, amplitude, frequency and 
initial phase. Below, the pulse duration will not be taken into 
account since its réle in many problems is a minor one. 

A monochromatic oscillation can also be represented in the 
following modified form of (5.1) 


s()=xcosM-- ysind, L<t<cth, 


where «= Acosg and y= Asing are the projections of the function 
s(t) on the orthogonal functions cos and sindt. Just as the ampli- 
tude A and the phase 9, the quantities x and y determine uniquely 
the form of the oscillation. When two oscillations of one and the 
same frequency are combined the corresponding xandy projections . 
add up algebraically; when (5.1) is multiplied by a number the 
projections are also multiplied by this number. It follows from 
this that the projections x and y can be considered as the compo- 
nents of a vector describing the given oscillation.* Thus each 
signal element is a two-dimensional vector while a signal with n 
elements is described by a set of 2 two-dimensional vectors. 

Let us denote the signal transmitted by S, its elements by s,, 
So, «.+» S,, and the cosine and sine projections of the kth element 
by s, and s; respectively. Then the signal S can be written either 
as a vector 


S=(Sy. Sqr coves Sp)s 


or as a rectangular matrix 


ke af f 
sy Sy eee Si 
s= oo” 8” . 
Ss; Sy ae - 


Wide use is made in engineering of more complicated processes. 
The most important among these are the following: amplitude- 
modulation processes, in which the amplitude is afunction of time 


*An equivalent description uses the complex notation 
$ (tf) =ue™, 


where u = Ae!’ is the complex amplitude. The real oscillatory process is charac- 
terized by the real part of the function s (#). * 
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s(f)= A(t)cos(Mt-++-¢), 


phase-modulation processes, in which the phase is a function of 
time 


s(t)= Acos[M-+ o(4], 
frequency-modulation processes 
s(t)= Acos[A(Ht+- og], 


as well asother processes obtained by combining the ones enumer- 
ated above, Their spectra consist no longer ofa single frequency, 
but of a band of frequencies centered about the ‘‘carrier’’ fre- 
quency A, The interpretation of the voltages of such signals as two- 
dimensional vectors proves rather difficult, but inthe very common 
case of narrow-band processes, in which the modulated quantity, 
whether the amplitude A(‘) or the phase g(t), varies much more 
slowly than the oscillating function cos\t, such aninterpretation is 
possible, 

The information transmitted is contained inthe characteristics 
of the signal voltages: either inthe amplitude, or in the phase, or 
in both. The variation ofthese characteristics from signal element 
to signal element can be accomplished by various methods depend- 
ing on the chosen method of exciting oscillations and on the infor- 
mation content. We need only point out that, in all methods, the 
variation of the physical characteristics is controlled by suitably 
assigning the values of parameters a, 8, 7, ..., which regulate the 
voltage characteristics; their concrete choice is determined by 
the content of the information transmitted. Therefore, a detailed 
notation for the signal must have the form 


(5.2) 


s(@, 8, 7, w=(5 (Be Yo vos) vee 84, Bs T, -) 


HG. Be qe 00s) cee 82GB Fe ean)} 


The various types of signals will be divided into two classes: 
deterministic and random, We shall assign to the first class sig- 
nals in which the dependence of the elements on the parameters 
has a determinate functional form, the parameters themselves 
assuming fixed values. The second class consists of signals that 
depend stochastically on the parameters. This denotes either a 
random relation between elements and parameters, or randomness 
of the values of the parameters, or, finally, both together. 

Both independently fluctuating and correlated random signals 
are considered. The presence of a constant component varying in 
the general case from element to element is admitted. 
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Of basic importance are the probability distributions of the 
combination of signal and noise, as well as those of noise alone. 
Among the various ways in which signal and noise combine, the 
one most often used is the additive combination of signal and 
noise. If by N we denote noise, expressed either by the vector 


N= (My, Mg oes My) 


with elements 7, m,...,,, or by the rectangular matrix 
We ho cae 2 

-( “a ). (5.3) 
1. Ny eee n, 


where 2, Mj,..., ”, m are the components of the vectors of the 
‘‘noise’’ voltages, we have at the output of the receiver the resultant 
voltage ZX, equaltothe sum ofthe corresponding elements of signal 
and noise 


Z=—S4HN 
or 


X=(s,+ my Sy fgs eeee Sy y)s 


which in matrix form is represented thus 


(5.4) 


, , , , , , 
a Sim, sya... sian, 
Sitamy sytany... 8%" 


It is assumed below that signal and noise are statistically inde- 
pendent. Expression (5.4) is conveniently written in a different 
form by introducing the more compact notation: x,,_, = s,-+ m,and 
Xo, = 5,-+m,, as well as Z,=s,-+",=(X__1, ¥2,). Thus the result- 
ant signal will be represented in one of the forms: 


== (2, 2o, wens Zp)s 
re i Hy Xg vee oo 
hq Ng Mg oe Bop 
Sometimes we shall represent the signal © as a 22-dimensiona 
vector 


K = (Ky, Hq, Hqy 000s Haq): (5.5, 


1, STATISTICAL PROPERTIES OF SIGNALS AND NOISE 161 


Even if the signal is nonrandom, the recipient of the message 
has to decide whether the useful signalis present or not in the in- 
put voltage of the receiver. In the case of a positive answer, the 
signal parameters have to be measured as accurately as possible. 
This problem proves particularly difficult in the presence of a high 
level of noise masking the useful signal. Therefore, knowledge of 
the statistical properties of noise plays a particularly important 
role. Numerous theoretical and experimental investigations lead 
to the conclusion that the components of noise-voltage vectors are 
distributed, as a rule, according to the normal law. We shall 
distinguish two cases: correlated noise, when the noise elements 
are stochastically connected with each other, and uncorrelated 
noise. It is assumed, below, that the expectations of the noise 
components are equal to zero, i.e., that noise contains no constant 
components. 

It is well known that the sum of multidimensional normal quan- 
tities is also a normal quantity, and that its covariance matrix 
is equal to the sum of the corresponding matrices of the com- 
ponent quantities if these are independent. Thus, when mutually 
independent signal and noise are combined additively we have for 
the 2n-dimensional probability density function the expression 


1 - 5 (X-A) Mo} (X-A) 
’ 


Psn(Xy +++ *2) = Va (5.6) 


where the following notation has been introduced: Di= M,+ My 
is the covariance matrix of the sum of signal and noise (M, 
and Yt, are the matrices of signal and noise respectively); D 
is the determinant of the matrix 2; XI7'X is the quadratic form 
corresponding to the 2”-dimensional vector X; A= (a), a, ..-» Qa) 
is a vector equalto the expectation of the vector X. The probability 
density of noise is represented in this notation by the formula 


1 ae XNy 


PN(Xy 0 e-s *1d) = TaD 2 
. N 


1 
oa (5.7) 


If the signal is nonrandom, the matrix St is the covariance 
matrix of noise only, and the vector A represents the 
components of the voltages of the useful signal. Therefore, the 
probability density (5.6) characterizes all types of signals and 
noise considered. In the particular case when the components of 
the output voltage are independent we have 
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2n 
1 (¥p7 2p)? 
1 —- % 
Poy Ay +++: A= Tape, * : 


The probability density (5.6) gives an exhaustive description of 
the statistical properties of the output voltage and depends on all 
parameters of the useful signal. The mathematical investigation of 
detection capabilities and performance consists in the application 
of operations of various kinds tothe normal distribution (5.6). For 
some widely used signals, however, the probability density (5.6) 
must, as a preliminary, be transformed to another functional 
form. This is explained by the fact that useful information is often 
transmitted not by means of all the characteristics of avoltage 
(amplitude, phase, frequency), but only by certainofthem or even 
only one of them, for example, the amplitude. The passage to the 
amplitudes means analytically a change of the variables in (5.6), 
and is precisely the passage from Cartesian to polar coordinates 
with consequent averaging with respect to the polar angles (in 
radio-engineering language, with respect to the phases). As a 
result of such a transformation, the functional form of the proba- 
bility density for many types of signal and noise turns out to be 
extremely complicated, and problems that are solved in closed 
form for normally distributed signals become in these cases un- 
tractable, 

Let us enumerate now the modifications of the normal proba- 
bility density which we shall deal with in the following exposition. 

We shall begin with the simple case in which the successive 
elements of the message transmitted, which are formedby useful 
signal and additive noise, are statistically independent, and the 
components of each of the elements are also independent and have 
equal dispersions. The corresponding probability density has the 
form 


- = *3n-1t oR 8) 
sme 1 20% (5. 
Py ien “d= T[oz° ‘ 
By passing to the polar coordinates 
Nop) == i 
2a-1= 1a C08 beats. hy (5.9) 
X24 Tp SIN Dy, 
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we obtain 


[ m 
2 
P(ry3 G13 Te Fah ood Tal 5 IL “mC e “k for r, > 0, 
Rk=1 Tc, 
| 0, if at least one r, is negative. 


By averaging with respect to the phases 9. ..., 9, over intervals 
(0, 2x) we arrive at the joint probability density of the amplitudes 
of the signal 


Bei. ae 
[4 20% for r,> 0, 
PUTyr oe erly) = RET SR k=1,..., 
0, if at least one r, 
is negative (5.10) 


which is the product of the so-called Rayleigh unidimensional proba- 
bility densities. This expression will be termed the distribution 
density of noiselike signal, The recipient of a signal with such a 
distribution assesses the information content from the amplitudes 
of the voltages received, the values ofwhichare characterized by 
the values of the parameters <j, equal to the sum of the disper- 
sions of the individual signal and noise voltages. 

Somewhat more complicated is the expression for the proba- 
bility density of the amplitudes of the voltages if the signal is 
rigorously deterministic. The components of the vectors of the 
signal voltages will be written, for the sake of simplicity, in the 
form 


(Sy O), (Sq, 0), «+ +s (Sq» 0). 


Then the joint probability density of the output voltages of the 
receiver will be equal to 


n _ e217 54) td 
2 


1 20’ 
P(X eae x)=I[] ze . 


par 28S, 


Again, by passing to polar coordinates according to (5.9) and 
averaging with respect to the phases, we obtain the distribution 
density of the amplitudes 
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n =- 


fe 7, ( Tek \ ¢ % for r,>0, 
ee IT 2 o( at) a> 


k=1 °p 


0, if at least one r, is negative. 
(5.11) 


which proves to be the product of the so-called Rice probability 
distribution densities.* The derivation of (5.11) takes into account 
the fact that the signal is notrandom, This formula retains, how- 
ever, its validity also in the case of a random signal having non- 
zero expectation. The value of each amplitude is characterized by 
two parameters s, and «,, one of which, s,, is only related to the 
signal, while the other one, o?, is the sum of the dispersions of 
useful signal and noise. The presence of a constant component in 
the signal (which is expressed by the fact that the expectations of 
the components of the vector of the signalvoltage are not equal to 
zero) introduces complication inthe formof the probability density 
of the amplitudes. Its analytical expression contains Bessel func- 
tions. Even more substantial complications arise in the case of 
statistical dependence between the signal elements. 

Let us consider an example of signal with correlation between 
the elements of the input voltage. Letthe signal elements have the 
form 


S=(Ajs, AgS, «+, A,5)s 


where A;, Ay, ..., A, are second-order matrices while s is a two- 
dimensional random vector with independent components. The 


a, 5, 
Cy a, 


transformations A,= are assumed to transform vectors 


with independent components into similar vectors. Itcan easily be 
verified that, for this tobeso, the elements of the matrix A, must 
satisfy the relations 


*The function /, (x) is the modified Bessel function of the first kind and zero order 
and is defined by the relation Jo (x)= J, (ix). It is represented by the power series 


co 
I) (x) = > “aaa , Its asymptotic behavior at infinity is described by the formula 
n=0 


Io(x) = = [} +0(—)]. 
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AyCp+5,d,=0, R=1, 2, ws, B 
Signals of such a form are met insome radar problems, in which 
A,= h,U*, 


where A, is a sequence of positive numbers and U is an orthogonal 
matrix. The resultant output signal is formed by the combination 
of the signal S with noise, the elements of which are independent. 
In order to evaluate the joint probability density of the amplitudes 
of the resultant voltage we shall fix the value of the signal-vector 
S. Then, owing to the independence of noise, the amplitudes of the 
resultant signal are found to be independent and each of them is 
distributed according to Rice’s law. Thus the joint conditional 
probability density of the amplitudes, with the condition that the 
amplitude of the useful signal have values s, is found to be equal to 


2 2 
beds 


2 
r 2 
P(fyy o-0s Ta l|S)= te “k to (“te s)) 
G 
2 


k=l [% 


|| Ags I] 
sil 
be obtained by averaging this expression over all possible values 
of s, The modulus of the signal-voltage vectors is distributed, as 

usual, according to the Rayleigh law 


where a,= The unconditional joint probability density will 
¢ om 
P(s)= Gre 48, s>0. 


On some reduction we obtain the following final expression of the 
required probability density 


Psn (Fir veer T= iss 
r 3b 1 a 
n rp “Hf _ ‘(zt = 3) Tl 1 jay 
sol hg (F be 
for ote vses Fg > O, (5,12) 
0 for other ry, ..-5 Tq» 
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which depends on 2n-+-1 parameters a;, a), ..., @,, of, 93+..., 0%, 9% 
The form of this probability density, which is represented by an 
integral not expressible in terms of elementary functions (except 
for the cases n=1, 2, which are of little interest), shows that 
calculations connected with a probability density of sucha kind do 
not lend themselves to the usual numerical methods of solution. 
We observe that in the absence of signal the joint probability 
density of noise is given by expression (5.10), and in the presence 
of signal by expression (5.12), in which there occurs the integral 
of a combination of exponential and Bessel functions. 

We observe that the probability distribution of amplitudes (5.12) 
arises also in problems connected with the passage of noise through 
multichannel systems, in which the frequency bands of the indi- 
vidual channels overlap each other thus causing stochastic con- 
nection between the noise elements. 

Let us now consider a resultant signal with arbitrary correla- 
tion between the components of the voltage vectors. We shall as- 
sume, for the sake of simplicity, that the expectations of the 
components are equal to zero, and therefore the 2n-dimensional 
probability density of the components has the form 


-ixm-'x 


1 
P(Y)= aoyp ° , (5.13) 


In the problems of detection theory considered below the co- 
variance matrix t= ||m,,|, where m,,—Mx,x,; is the product- 
moment of the components x, and x,, is characterized by the 
following relations between the matrix elements: 


M1, Mj 
Mp -1,.91-1 = Mop, op (5.14) 


Mop— 1.91 = — Mo1_}. 2+ 


The first of these indicates that the covariance matrix M 
is symmetrical. The second relation indicates that thetwo groups 
of components x1, X53, ..., Xp_-1 aNd xy, X%4,..., %_, are correlated in 
the same manner, and therefore the dispersions of the two com- 
ponents of the vector of the resultant voltage are equal 


2__ My2  — Mx? 
& = Mxp,_, = Mx},. 
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The last relation (5.14) indicates that these components are inde- 
pendent (for k=1/) 


MXop_ 1X 9~ = Mop 1,24 = — Mrg—1, 29 = — MX o4_ 1X0: 
i.e., 
M41, 24 = 0. 


Let us find the n-dimensional probability density of the am- 
plitudes of the signals. We observe, to this end, that the ele- 
ments of the inverse matrix #7? = \|@,;|| Satisfy relations of the 
form (5.14). Therefore (5.13) can be rewritten in greater detail 
thus: 


P(Xp- ee Xq,) = 
2a 


1 1 
Ga yp %P\— Dy lens 21-1 X2~¢—1% 91-1 -F Xop*0)) 


H+ Go q—1, 24 (X2~—1% 21 — Xo1-1%08)] \. 


We pass in this expression to polar coordinates, by putting* 
Oop 1» 9y-1 == Oy COS Oy, Ooy—1, 07 = Og, Sin 8,,. Then the quadratic form in 
the exponent is found to be equal to 


n 
o= , x, [49% —1, 21-17 #1 COS (9, — Fm) + Gon—a, oF elt 


n 
x sin (¢,—9,)I = Dearest 2 2, Dy gh 1 COS (9 — Pp — 97). 
By averaging the phase-and-amplitude probability density obtained 


1 F 
Py see Th ea Oe seen yi Pp seer Pq) 


Q2x)"VD 


PCT ys ys Ta» Fai 05 Tae Pn) = 


with respect to the phases, we arrive at an expression for the 
probability density of the amplitudes alone in terms of a multiple 
integral 


*Thus we have 


ber = big, 9%, =— Dy 
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By writing such an integral as a series of repeated integrals and 
using the known formula from Bessel-function theory 


J 
e-seose== Sy (—1)" Lae) em, 
ac—co 


where /,(z) is the modified Bessel function of the first kind and 
mth order, we obtain on reduction the following final expression 
for the required probability of the amplitudes where 


n 
1 2 
-7 2 pele 


1 
DT 4s cess ei sve Tl 


“TT IIe, ergy OniT aD 


RaLISk 


os (S b> vata) (5.15) 


k=1I>k 


where 


(; for »,,=0, 
“~~ 2 for Vpz > OL 


The expression given emerges asa generalization ofthe proba- 
bility density (5.12) for arbitrarily correlated signal and noise 
voltages. It is evident that in both cases the existence of explicit 
analytical expressions for the probability densities does not 
necessarily imply the feasibility of elementary calculations ofthe 
probability characteristics, It is just this fact that makes it nec- 
essary to have recourse in problems of detection theory to the 
Monte Carlo method. 

In concluding this section we shall consider one more type of 
signal which occurs when at the receiver output there is a 
threshold-type device. The- latter’s action is as follows: any 
voltage exceeding the threshold level ¢ gives rise to a standard 
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unit signal while a voltage smaller thanthe level c is not recorded, 
i.e., gives a ‘‘zero’’ signal. Thus, as a result of the passage of 
signal through a threshold-type device the signal may be said to 
be ‘‘quantized,’’ in that it assumes only two values, which can be 
termed the ‘‘one’’ and ‘‘zero’’ values, In this case the signal ob- 
served at the receiver output is a sequence of zeros and ones, 
The probabilistic description of a ‘‘quantized’’ signal is carried 
out by assigning either the probability of the occurrence of one or 
other of the two possible symbols if the successive elements of 
the resultant signal are independent, or the probabilities of com- 
binations of zeros and ones in the case of correlation of the signal 
elements, In the first case we shall denote by p, the probability 
that the amplitude of the /th component exceeds the threshold 
level c, and by g,=1—p, the probability that this component falls 
below the threshold. Since the amplitudes of the voltages are dis- 
tributed according to the Rayleigh law, the probabilities of interest 
to us can be written explicitly thus: 


x ea 


2 Ree 
paPi>onf Se a dx—=e “t, 


The probabilities of various combinations of zeros and ones are 
found without difficulty. The probability that an n-element signal 
consists of ones at the placesi;, i,,..., /,andzeros at the remain- 
ing places is equal to 


a 
Pi,, cd =D, ; pei, Sa 


A more complex picture is obtained in the case of correlated 
elements. In this case the description of the signal is accomplished 
by assigning the joint probability of occurrence of ones and zeros 
at the corresponding places, this probability being denoted by 
Diy, ig, «20s O35 Fy eee Js) Where a+tB—=n, The evaluation of such 
a probability is carried out by integrating the probability density 
of the amplitudes over the corresponding region: 


Play ones ii fv eee y= 


e 
fer, ws fan, fer, pt [Ry Rs rn): 
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2. FORMULATION OF THE BASIC PROBLEMS 
OF DETECTION THEORY 


The problems formulated in this section arise in estimating the 
performance of various methods of extracting a useful signal from 
a noise background. 

' As a transmitted message passes through the communication 
channel and is amplified in the receiver, it is polluted by the 
channel noise and the receiver noise, and in such distorted form 
arrives at the recipient who assesses the message content on the 
basis of the voltage amplitudes. In many cases, for example in 
long-distance communication, the intensity of the signals proves 
to be so small that their very presence becomes uncertain, and 
the first problem the recipient has to solve consists in assessing 
whether useful signal is contained in the observed voltage or 
whether the latter is generated by noise alone. The random nature 
of the amplitudes of the voltages of which the incoming messages 
consist indicates that their complete characteristics will be con- 
tained in distribution laws. These laws willbe different according 
to whether useful signalis present and at the output of the receiver 
there is a mixture of signal and noise, or whether there is only 
noise. It has been indicated inthe previous section that in problems 
of detection theory this difference shows in the values of the 
parameters of the distributions. Thus the initial (2)-dimensional 
normal distribution of the components of the vectors of the voltages 
is assigned by the covariance matrix 9t, which in the pres- 
ence of signal is equal to the sum of the covariance matrices 
of signals 2, and noise WM, respectively. In the absence 
of signal this matrix is equal to It,. Of course, the amplitude 
distributions obtained from these normal distributions also differ 
in the values of the parameters; by assuming that, in the corre- 
sponding probability density p.,,(of the mixture of signal and noise), 
the parameters generated by the elements of the matrix Mt, are 
equal to zero we arrive at the probability density p,, of the distri- 
bution of noise. 

Thus the problem of detecting signals on a noise background 
reduces to testing, on the basis of observed values of the signal 
received, a hypothesis on the values of parameters of its distri- 
bution. Such problems are studied in mathematical statistics (see 
12, 29, and 43). 

We shall consider two methods of solutionofthis problem. One 
of them has been developed by Neumann and Pearson, the other 
by Wald. The initial ideas of thetwo approaches are the same and 
consist in the following. 
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Let & be a random quantity which can be in general multi- 
dimensional E=(E,, &, .... &,) With probability density p(X; a), 
depending on the parameter a.* The functional form of the proba- 
bility density is determined but the values of the parameters are 
unknown, As to the possible values of the parameters two hypoth- 
eses, H, and H,, are permissible, These are of two kinds: 
simple or composite, They are termed simple when the hypoth- 
esis H, consists in assuming «=a, while the hypothesis H, 
consists in assuming a—=a,. The hypotheses are said to be com- 
posite when at least one of them allows the parameter « to assume 
several values. An example of the last situation is the following 
alternative: according to the hypothesis H, we have a0, and 
according to the hypothesis H, we have a> 0. We shall be con- 
cerned, below, with simple hypotheses. 

Which of the two hypotheses is valid is assessed on the basis 
of results of drawing sample values of =, which has given proba- 
bility density p(X; «). A sampling point willbe denoted X=(x,,x...., 
x,,)3 the set of all such possible points belongs to an m-dimen- 
sional sampling space. The statistical methods considered for 
testing hypotheses consists in dividing the sampling space into 
subspaces (in Neumann’s and Pearson’s theory there are two of 
them, and in Wald’s sequential analysis three), such that if the 
sampling point is found in one of them, the hypothesis 4% is re- 
jected (this is called the critical subspace), if it is found in 
another the hypothesis is accepted, and finally (in sequential 
analysis), if the sampling point is found in the third subspace, 
the experiment is continued owing to insufficient data. 

The random nature of the sampling point may lead to the 
following types of wrong decisions as tothe validity of the hypoth- 
eses: we may accept the hypothesis H, although it is H, that is 
valid (error of the first kind), or else we may reject the hypoth- 
esis H, in spite of the fact that it is valid (error of the second 
kind), 

Let us illustrate what has been said in terms of detection 
theory: the recipient of the message, having measured the am- 
plitudes of the voltages at the output of the receiver, must es- 
tablish whether they are caused by the action of noise only (the 
hypothesis H,) or whether they contain useful signal in addition 
to noise (the hypothesis H,). Suppose the signal is actually pres- 
ent; as the recipient of the message hasno previous knowledge of 
this, he may, not having detected the signal, mistakenly reject 


*To simplify the notation, we assume a to be a unidimensional quantity. The ex- 
tension of the theory to the case of several parameters is evident, 
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the correct hypothesis. On the other hand, suppose there is no 
signal; the recipient having ‘‘read’’ a nonexistent message may 
mistakenly take noise for signal. To havethe probabilities of both 
errors small at the same time is impossible. It is natural there- 
fore to fix the probability of mistaking noise for signal (i.e., the 
probability of error of the first kind) and to attempt to choose the 
critical region in such a manner that the probability of loss of 
signal (i.e., the probability of error of the second kind) be a 
minimum, The solution of this problem is dealt with in Neumann 
and Pearson’s theory. 

Thus there are two alternative hypotheses: one of them, H), 
consists in assuming that sampling has been carried out from a 
general set with probability density p(X; a), i.e., thatthe param- 
eter a, which uniquely defines the probability density, has the 
fixed value a; the other hypothesis, H,, assumes that the param- 
eter is equal to a, Let us fix the probability, p,, of error of the 
first kind. We shall define the critical region as the set of points 
X that satisfy the inequality p(X; a,) > cp(X; ag), wherethe constant 
c, which we shall term in the sequel threshold, is found from the 
condition that the probability p,; be equal to its assigned value. It 
may be shown that such a choice ofthe critical region is optimum 
in the sense that the probability of error of the second kind will be 
a minimum. 

Let us introduce the notation 


__ Pf (X; a) 
HA) ~~ p(X; a)’ 
which in statistics is calledthe likelihood ratio. Then the optimum 
criterion for deciding between the two hypotheses (this is the con- 
tent of the basic theorem of Neumann and Pearson’s theory) will 
be expressed thus: if 


U(X) >e, 


the hypothesis H, is accepted; in the opposite case the hypothesis 
H, is accepted. 

The probabilities of errors arising as a result of adopting this 
criterion are expressed by integrals 


p= f pXadX pu= f piX%apaxX, (6.17) 
i(X)>e UX)<e 


in which the domain of integrationis the subset of the correspond= 
ing sampling space defined by one of the inequalities 1(X)> c or 
1(X)<c. The order of multiplicity of the integrals is equal, of 
course, to the number of dimensions of the random quantity X. 
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We shall consider the application of Neumann and Pearson’s 
theory to the problem of detecting a signalon a noise background. 
The recipient of the message has available n voltage amplitudes 
on the basis of which, by knowing the statistical characteristics 
of signals and noise, he has to establish whether useful signals 
are present or absent. The best method of doing it consists, ac- 
cording to Neumann and Pearson, in substituting the observed 
values of the amplitudes in the likelihood ratio and comparing its 
value with the threshold c. This means thatthe optimum receiver 
must comprise a device to accomplish on the input voltages 
the operations indicated by the functional form of the likelihood 
ratio. As a result of using this algorithm, and for a given proba- 
bility of error of the first kind (which in technical applications is 
called the probability of false alarm), we reduce to a minimum 
the probability of error of the second kind (i.e., the probability of 
mistaking signal for noise). We usually reckonnotin terms of the 
latter probability but in terms of the probability of the opposite 
event: the detection of signal. The probability of false alarm and 
the probability of detection will be denoted F and D respectively. 
They are expressed, in terms of the probability densities of noise 
Py(X) and of signal and noise combined P,,(X), in the following 
manner: 


F= f Py(X)dX, D= ff Psy(X)ax. (5.18) 
1(X)>e i(X)>e 


The procedure described gives for a given probability F the highest 
value of the probability D. 

An assessment of optimum methods of extracting signal from 
noise requires the evaluation of these probabilities; this need is 
even greater when, owing to the complexity of the optimum 
method, one has recourse instead of it to different methods. The 
difficulties of evaluating F and D arise mainly from the complex 
structure of the region of integration, which is defined as the set 
of points for which the likelihood ratio /(X) is larger than the 
threshold c, as well as from the high order of multiplicity of the 
integrals. 

We have considered above a number of typicalforms of proba- 
bility densities of the voltage amplitudes. We shallnow write down 
the corresponding likelihood ratios, which will enable us to il- 
lustrate typical features of the integrals expressing the probabili- 
ties F and D. We shall firstly isolatethe parameter of the proba- 
bility density the values of which distinguish the various hypotheses 
from each other. In the case of random signal we shall take for 
such a parameter the quantity g, namely the ratio of the standard 
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deviations of signal and noise, o, anda,, (briefly the signal-to- 
noise ratio); if however the signal is not random, the parameter 
will be s, namely, the ratio of the signal amplitude to the standard 
deviation of noise. Then the hypothesis H, consists in assuming 
that there is no signal g=0 (or s=0), and that the voltage observed 
is caused by noise alone. According to the alternative hypothesis 
there is signal, which is characterized by a fixed value of the 
parameter: g, for a random signal and s, for a nonrandom one, 
Actually the value of the parameter is, as a rule, unknown to the 
message recipient, and therefore, strictly speaking, we have to 
test the simple hypothesis H,{qg=0} against the composite one 
H,{g>0}. The corresponding theory, however, has been little 
developed and is inapplicable to many types of signals. The fol- 
lowing argument provides justification for using, in such situa- 
tions, the theory of simple-hypothesis testing. Let the minimum 
acceptable detection probability D, be prescribed on the basis of 
various relevant considerations (its value varies from 0.1 to 0.99 
in various applications). We shall term threshold signalthe value 
of the signal-to-noise ratio that ensures this probability and shall 
denote it by ¢g, (or, for a nonrandom signal, by s,). If now the 
receiver is so designed that it best realizes this detection proba- 
bility, the corresponding threshold signal will be a minimum.* 
For large values of the signal-to-noise ratio (and only these are of 
interest here) the incoming signal is more intense and the minimum 
acceptable detection probability D, is ensured although the receiver 
does not process such signals optimally. The above considerations 
are of an intuitive character and, of course, do not provide a 
rigorous proof of the possibility of using the theory of simple- 
hypothesis testing. However, as experiments show, they are 
adequate for certain engineering applications. 

The determination of the threshold signals is not the only 
computing problem of detection theory. Of no less importance is 
the plotting of “detection curves,’’ i.e., of the detection proba- 
bility D as a function of the signal-to-noise ratio. To do this, 
having chosen a receiver that best ensures the detection of signal 
with false-alarm probability equal to an assigned value D,, and 
having thus fixed the parameter q, (or s,), one considers the de- 
tection of signals the probability densities of which are charac- 
terized by different values of the parameter g. In a number of 
elementary cases such functions D=D(q; F) can be obtained ex- 
plicitly. Many problems, however, require a large amount of 


*The minimization of the threshold signal is important in engineering, since it 
enables us to reduce the rn, hl expended in the transmitter for ee the oscilla- 
tions carrying the useful signal, 
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computation for obtaining numerical solutions. Examples of such 
problems will be repeatedly met in the sequel. 

Let us explicitly introduce the parameter g (or s for non- 
random signals) in the notation for the probability density of signal 
and noise combined, so that 


Psy (X) = p(X; 9); 
then the probability density of noise alone will be 


Py (X) = p(X; 0). 


The likelihood ratio will also depend on the signal-to-noise ratio: 


‘X; 
1G = a o 


We shall begin our enumeration of likelihood ratios with the case 
of noiselike signals described by the probability density (5.10). 
By taking the noise variance as unity, we express the variance o? 
in the form 


o = 1-+ a29?, (5.19) 


where a, is the ‘‘envelope’’ of the sequence of amplitudes of the 
useful signal, Then the probability density of the amplitudes of 
signal and noise combined is equal to 


2 
r 
Ps k 


——- 
P(ty 200s Ta DS reg *(i+9h¢°) | 
' ; I Gear, 


and the probability density of noise alone is equal to 


(iii teio—————— 


1 
7m é 
e+e 


(5.20) 
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The hypothesis of existence of signal is accepted if the likelihood 
ratio satisfies the inequality 


LC sees Tey QS i 


By passing to logarithms and carrying out elementary transforma- 
tions, we arrive at the equivalent inequality 


aot — acs In (1+ 429”), (5.21) 


k=1 if k=l 


The right-hand side of this inequality will be denoted, for 
brevity, ¢,. 

Thus the detection of noiselike signal is best accomplished by 
means of a (weighted) sum of the squares of the amplitudes. Such 
processing of signals is called square-law processing. The proba- 
bilities F and D are expressed by improper z-tuple integrals taken 
over the region external to the ellipsoid 


1.6.; 
a the 
F=f ‘igi f Tne? ar, dr,,; 
3 k=l 


3 
p= .. STL e 249) Gr, we. Uae 
R= Uae 


The threshold ro q, and the corresponding threshold level c 
ensuring the assigned false-alarm probability must be found by 
solving the system of equations 


Fy c)=F, Dg c)= 2D). (5.22) 


In order to determine the detection probability D as a function of 
the signal-to-noise ratio q we need to evaluate the integral 


2 
Tr 


. * 2 2 
IG n o=f ve ST Fee +E) ar... dr, 
3, ear 1,9 
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We observe that 
BO O=F; DO a5 O=D,. 


It can easily be verified that the above integrals can be ex- 
pressed in terms of tabulated functions, as combinations of x?- 
distributions with an even number of degrees of freedom. If the 
signal envelope is constant (i.e., a, a,forallk and), the proba- 
bilities D and Fare expressed by the x?-distribution with 22 
degrees of freedom. 

Therefore, the use of the Monte Carlo methodfor evaluating the 
characteristics of square-law processing is not advisable. How- 
ever, owing to the fact that a simple analytical representation of 
the required probabilities is available, this problem canbe used to 
assess the power of the Monte Carlo method: to verify the quality 
of pseudo-random numbers, to estimate the rate of convergence, 
etc. 

Let us now consider the detection of signals that differ from 
one signal element to another by a constant factor a,, the noise 
voltages in different elements being uncorrelated with eachother. 
The probability density of the amplitudes of signal and noise com- 
bined is equal, for an n-element message, to 


Pry 003 Ty N= 
eerie 
tae Mt f Se \O 4S TY (Bs) as 
res oe y I : ay : 


while the probability density of the amplitudes of the noise voltages 
alone is equal to 


P(ry ses Pq O=[[ Be ™. (5.23) 


The likelihood ratio is expressed by the integral 


Urry. Ta O= , 
e, GBS) Fie in 
[ae ( nes s) ds, (5.24) 
ry =1 
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The optimum receiver, according to (5.24), evaluates for each 
received pulse the functions to( ts) jJ=1,..., 2 and then 
c 
J 


a s 1 ae s2 
averages their product with weight | exp _— (44 D+) = \. 
The errors arising in sucha method of reception are characterized 
by the probabilities F and D, equal respectively to 


n -+y 
F= fi... f [[ be "an... dry 
U(tys e009 Tai a)>e hal %R 

ee 
ae lp 207 

D= tees IT Fe k dry... dr, 

L(Fys see Ty Q) Oe 
P= 1 . a s3 
s bal ty, (ry 
x [ae hot TT (sas 
f=1 


Just as in the case of square-law processing, the threshold signal 
q, and the threshold level ¢ are determined by the system of 
equations (5.22). The detection probability D as a function of qg is 
found by evaluating the integral 


DOGO foe f PU preety Ndr... dry (5.25) 
E(ry, ese Pai I) > 


In this case, it is no longer possible to reduce the calculations 
needed to elementary or tabulated functions, while the use of 
quadrature formulas is made difficult by the high order of multi- 
plicity of the integral. Thusthe Monte Carlo method emerges here 
as the only acceptable method of computation. 

Let us consider the form of the likelihood ratio for arbitrary 
correlation between the components of the signal and noise 
voltages, when the successive signals in the message are con- 
nected not functionally but stochastically. The corresponding joint 
probability density of the amplitudes has been obtained in Section 
1, the formula (5.15). We shall qualify with an index 1 quantities 
corresponding to signal and noise combined and with an index 0 
quantities corresponding to noise alone. Then the likelihood ratio 
is found to be equal to 
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Ve gE (Che Pee)" 


Lp see i N= 
Mi (—1)2 Det il at Lei vet (Shar a1) £08 (SD vps) 
‘Ss (—1= Let It pt 3 1, (Our ers) cos (> > va) 


In the particular case when the noise voltages are independent, 
this expression is somewhat simplified 


(ry, oe ik, ra Om age tee ‘rR ye 1h 2° kt 


x Ul IT 54 v9 Oral ed) COS (> b> vi) 


The false-alarm probability F and the detection probability D are 
expressed by means of the corresponding probability densities in 
the usual manner. 

Let us pass to the other method of signal detection in noise. 
This is based on sequential analysis, a development of mathe- 
matical statistics, and has attracted much attention in recent years; 
a great number of papers has been devoted to its application to the 
group of problems of interest to us. In sequential analysis, in 
contrast to Neumann and Pearson’s theory, the amount of sampling 
used to test a statistical hypothesis is not fixed in advance, but is 
determined as a result of the test carried out. The advantage of 
this method is due to the reduced number of tests needed, which 
leads to economy in the time spent on the experiment. 

The practical realization of the method of sequential analysis 
in detection problems consists in the following: the hypothesis H, 
is that signal is absent, and therefore the signal-to-noise ratio, 
i.e., the parameter on which the probability density of the voltage 
amplitudes depends, is equal to zero, and the alternative hypoth~ 
esis H,, that the voltages observed are caused by the combined 
action of noise and signal, the latter being characterized by a 
fixed value of the signal-to-noise ratio. Let us denote by 


Pir; q) Py(rys 9; 9), eoen Pi(ty sees Fry q): “ae 
and : 
Py (ry; 0), Po (ry. 7o3 O)s eee Pa rys esos Pps 0), - es 


the k-dimensional joint probability densities of signal and noise 
combined and of noise alone. The likelihood ratio will be denoted 


by 
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Py (Pyro 003 TH 
Ate 3200 = aos Te A , 


Let us assign the positive numbers A and B(A>8). The tests to 
establish which of the two hypotheses H, and H, is valid will be 
carried out in the following manner. As a first step, having 
measured the amplitude r,, we evaluate J, (r,; ¢). 

If it is found that /,(r,; ¢) >A, we accept the hypothesis H,. If 
the inequality B >1,(r,; 9) is satisfied we accept the hypothesis H,. 
Finally, if B < l,(r;; g)< A, the decision as tothe validity of one or 
other hypothesis is postponed, and further tests are carried out. 
This means that the next amplitude 7, is measured and /,(r;, ro: 9) 
is evaluated. Again there are three possibilities 


Liry 73 D> A. bry rg) < Band B< (ry, 193g) <A. 
According to which of these inequalities is satisfied, we proceed 


in a similar manner to the above, If at the (— 1)th sequential 
test no decision has yet been reached, i.e., if 


BoA Cp oven gi QCA a=1, 2,.,.,%—1, 
then, at the Ath step, we add to the amplitudes r,,r,,..., r,-, the 
amplitude r, and evaluate /,(r;,.... 7,3 g) which is again compared 


with both thresholds A and 8. The thresholds A and B are so 
chosen that the probabilities of errors ofthe first and second kind 
(i.e., the probability of false alarm and the probability of signal 
rejection) are equal to fixed values; in fact 


D 
A= B= 


It can be shown that for many important types of distribution the 
probability of the process of sequential testing ending with the 
acceptance of one of the two hypotheses is equal to unity. This is 
true, in particular, of independent observations, when the k- 
dimensional joint probability density is expressed as the product 
of the probability densities of the individual arguments: 


Pury coos Tes DHPC DN Py (ro G) «++ Piles 9)- 


In connection with the fact that sequential analysis is used for 
reducing the amount of sampling (or else, which is the same, the 
observation time), the main characteristic of performance of the 
method is the probability distribution of the serial number of the 
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end step of the process, i.e,, the distribution of the serial number 
of the step at which, as a result of analysis of the amplitudes 
observed, one of the two concurrent hypotheses is accepted, It 
suffices in certain cases to restrict ourselves to the first two 
moments of this distribution. 

There are four possibilities of ending the sequential analysis 
at the vth step: the correct hypothesis H, is accepted or is re- 
jected, the correct hypothesis H, is accepted or is rejected. Let 
us denote the corresponding probabilities by the symbols 


Po; 9), = pgs 49) 
for the first two possibilities and 


Pg @)> Py 9) 


for the last two possibilities, and let us recall that according to 
the hypothesis H, a signal of intensity q is present in the voltage 
observed. 

Let us consider in greater detail the events the probabilities 
of which are described as po(v; 0) and po(v; g). The occurrence of 


any of these means that the sequence of amplitudes r;....,. r, with 
joint probability density P,(r,,...1 7,3 9, #=1,...,¥, is such that 
the following inequalities are satisfied: either 
BU (rys wees Pe QCA, k=1,..., v—1 
and 
Apes ty QB 


(when the first of the events is realized), or 


BoA ee EE DCA R= 1... vd 
Lp. Ty Qo>Aa 


(when the second event is realized). The meaning of the events 
having probabilities p,(v; 0) and p,(v; g) may be illustrated ina 
similar manner. 

In order to clarify the typical difficulties that are met in evalu- 
ating the above probabilities, we shallconsider the simple version 
when the amplitudes r,, 79, ..., 7,5 +.» are independent and the like- 
lihood ratio is expressed in the form 


hp wc ty = Ths (res 9): 
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It is convenient, in this case, to pass to logarithms, as a result 
of which the process of sequential detection reduces to adding the 
random quantities ¢,—Ini,(r,;q)- If the following inequalities 


nB<O+O+... +0,<indA, k=1,..., v—1, 
are satisfied at the first v-1 steps, and one of the inequalities 


C+... +C,>mA=a, 
Ct... +O, <inB=s, 


is satisfied at the v-th step, the process ends with one of the two 
hypotheses being accepted. Owing to the statistical dependence 
between the sequential sums (,+ ...-+¢, and 4+... +,, m> 
n==1, 2,..., the calculation of the probabilities of interest to us 
reduces to evaluating n-tuple integrals. 

We shall write down, for example, the formulas for the prob- 
abilities of the serial number of thefinal step of sequential analy- 
sis for a noiselike signal, To make matters simpler, we restrict 
ourselves to the case of a constant signal envelope 


Rk 
2 
Indy Op oscn Tei D= sq ogy Ik +e 
i=1 


The upper and lower thresholds of the quantity } ind, are expressed 


in terms of A and B and, after elementary transformations, are 
found to depend on the serial number of the test 


TE lin Ape -In(l+-@h 


b,=2 =e [InB+-&-In(i+q)], 


a,=2 


so that the process of analysis is continued until the inequalities 


k 
n<Di<a, 


are satisfied, and is ended with accepting the corresponding hy- 
pothesis, when at the vth step the quantity In/,(r,. 7.,..., r,; g) lies 
outside the limits of the interval (6,, a,). The quantities r? are dis- 
tributed according to the exponential law 
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1 = 
room| a" 4, #30, 
0. x<0, 


where d= 1ifonly noise is present, and d=1-+q? if there is signal 
and noise combined. From this we obtain an expression for the 
probability of ending the process at the vth step by accepting the 
hypothesis H, of presence of signal, p)(»; 9) or p,(¥; 9): 


Po (vi 9) } 


Pq (%; 9) 
© @,_,-(%,+ oes +%,_3) 


P(x,) dxe f P (X,-1) dxj-1 wee 
B— (Ft F4y 1) by (Ft see #4) _ 9) 
Ay *1— Xy @,—2, a, 
P(x)dx, [ P(x) f P(m)dx, (5.26) 


b3-X)—Xq bax, by 


In a similar manner we can express the probability of accepting 
the hypothesis H, of absence of signal. Itis evident that the evalu- 
ation of such integrals does not involve any difficulty of principle, 
but it becomes extremely laborious as the number na increases. 
For other distributions such caleulations may prove impossible to 
perform. We shall consider, for example, the investigation of the 
probability distribution of the quantity v for adeterministic signal, 
described (when combined with noise) by the probability density 
(5.11). In this case, the form ofthe integrals (5.26) becomes sub~ 
stantially more complicated. In fact, the likelihood ratio at the 
first step has the form 


lv; s)= e Tire), 


where the amplitude 7 is distributed according to Rice’s law 


ee 
p(x) xe 7? Ih(rs), 


if a signal has been transmitted, or according to Rayleigh’s law 


= 
p(x)= xe ?, 


if noise alone is present. 
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Tests are continued until the sum 
In Jo (rs) + Indy (ros) + ... Indy (7,8) 


lies beyond the limits of the interval (4,, a,) where 


s? 
a,=In A+k->> 


2 
b, =n B+k->. 


The probability density of the quantity (= In/,(rs), wherer is dis- 
tributed according to either the Rayleigh or the Rice law, can be 
obtained in principle on the basis of known rules for evaluating the 
distribution function of a random quantity. Clearly, however, we 
cannot expect to be able to write down explicit analytical expres- 
sions for these probability densities. Therefore the evaluation in 
closed form of the distribution of v is impossible and only the 
Monte Carlo method enables us to obtain numerical results. 

We can calculate in a similar manner the characteristics of 
sequential detection of other types of signals. 


3. PROCEDURE FOR THE SOLUTION OF THE MAIN 
PROBLEMS OF DETECTION THEORY 


It has been shown in Section 2 that the solution of the main 
problems of detection theory reduces to evaluating multiple in- 
tegrals. 

We shall consider, for example, the method of calculating the 
detection characteristics of a noiselike signal in the case of 
square-law processing, The probability of false alarm F and the 
detection probability D are expressed in the following manner: 


F= (0; 4; ¢), 
D=9@G;%;¢) g>0, 


where 
th 


DG: 1: sil obits. =P e 2litaze?) ar ps 


k=1 


the region of integration is 
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a 2 
a 
= . 2 
26—{ru seer Tai Thee A>: 
k=1 & 


and the limiting threshold c,, is equal to 


Cy, = - E +2 In(l+ a 


The evaluation of such integrals by the Monte Carlo method is 
accomplished by simulating a sequence of series of Rayleigh’s 
pseudo-random numbers 


Tip Tig» eoer Tint 
Toys (oq: ves Tops 
Tru Tn: eens nua 


either with parameters 1-+ a2q? (for signal andnoise combined) or 
with parameter 1 (for noise alone). From the elements of each 
row, which simulate the amplitudes received, we form the sums 


bi 2 
a 
¢ = —_*__, r2 ‘ 
é > 1+ aag? ik 
which are compared with the threshold c,,. Ifthe inequality 


Gi >Cq, 


is satisfied, unity is added tothe sum» in the counter of success- 
ful tests. After repeating this procedure N times, the required 
integrals are identified with the quantity 


’ \ 


v° 


The method can be simplified if we recall that the square of a 
Rayleigh-distributed quantity has an exponential distribution. 
Therefore, by generating pseudo-random numbers with this dis- 
tribution, we avoid the operations of root-extraction and squaring 
to be carried out on each ‘‘Rayleigh’’ number, What has been said 
concerns the method of evaluating a single integral denoting either 
F or D. In reality, one has to find many such integrals, since in 
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the majority of problems one has to find a threshold ¢ such that 
the probabilities F and D are equal to their assigned values. Let 
us first consider the simpler case where the expected signal gq, 
and the probability of false alarm F are fixed. To this end, an 
electronic computer is used, as is indicated above, to generate 
the random numbers 


a 
c= Lipa T+aag? 2 re 


where the 7? are distributed according to an exponential law with 
density 


pase -¥ 


and represent the squares of the noise amplitudes. Each of the 
numbers ¢ is entered in a histogram and after a sufficiently large 
number of tests, when the form of the empirical probability den- 
sity has been determined with acceptable accuracy, one determines 
the threshold cz, (and with it also c) which is such that the proba- 
bility that the quantity ¢ exceeds itis equal to the assigned proba- 
bility of false alarm F. After this, it remains to evaluate the 
probability of detection Q(g; q,; ¢) in the region of values of the 
parameter g which have been considered, and in particular for 
qg=q,. By virtue of the fact that the threshold ¢,, is determined 
from a histogram and not by directly solving the equation 


Tx (x)dx =F, 


a 


the expenditure of machine time is reduced. 
Let us consider now the method of calculation of the threshold 
signals. The probabilities F and D are assigned: 


F=9(0; 9; ¢), D=@(q M4: ©) 


and it is required to find the corresponding values of ¢,; and ec and 
then to calculate (q; q,; c). In comparison with the previous one 
this method presents considerable difficulty, owing to the fact that 
the likelihood ratio /(r, q,), characterizing the region of integva- 
tion in both integrals (for F and D), depends on the unknown 4). 
Therefore the solution of the system of equations determining g, 
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and ¢ is accomplished by atrial method: one assigns a value of 
g, and makes up a histogram of the quantity ¢ (simulated on the 
assumption of the presence of noise alone), from which the 
threshold level c, (and with it also c) is chosen. Then 9(q;; 91; ¢) 
is evaluated; if the result is found to be larger (smaller) than the 
assigned probability D, q¢, is increased (decreased). Of course, 
this process is very laborious and a considerable amount of ma- 
chine time is required to achieve satisfactory accuracy in the 
determination of g, and c. In particular cases, for example for 
a noiselike signal with constant ‘‘envelope’’ (a,—a, forall & and 
!), the likelihood ratio does not depend on g, and then, having de- 
termined the limiting threshold, we find the threshold signal-to- 
noise ratio by solving the equation D(q,; 9,; c)=D. 

We have considered above the calculation of the threshold sig- 
nals and of the detection characteristics for the case of square-law 
processing of noiselike signals. The main features ofthe applica- 
tion of the Monte Carlo method are retained also with other types 
of signals, the only difference consisting in the methods of simu- 
lating the corresponding random quantities. These methods are 
considered in Chapter VII, in which Section 6 is devoted to multi- 
dimensional distributions met in detection theory. If, for example, 
the noise and signal components are correlated, thenthe calcula- 
tion starts by constructing the matrix A which transforms inde- 
pendent normal quantities into correlated quantities with assigned 
covariance matrix 9. The successive independent normal quan- 
tities, transformed by this matrix, are used to find the ampli- 
tudes; these are substituted in the likelihood ratio which is 
compared with the threshold, etc. 

Finally, let us consider how the characteristics in sequential 
analysis are calculated. As we have seen in Section 3, the object 
of the calculation is to findthe probability distribution of the num- 
ber of steps needed to end the process. This is achieved by simu- 
lating the sequence of random quantities 


Ty los ones Tye woe 
with joint probability densities 
PrP) e Py (ly Tahs wis Pg Ge vase Tes ove 


of one or two types: either concerning signaland noise combined, 
or noise alone, and by calculating the frequencies of the events 
consisting in the following inequalities being satisfied 
BLU HISA. BHleicne—1. 
L(fy sso My) >A, 


or 
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hice PB. 


This means that at each step of the sequential analysis the calcu- 
lated value of the likelihood ratio is compared with thresholds A 
and 8, and when it lies beyond the limits of the interval (8, A) the 
serial number of this step is entered in the histogram. Then the 
process is repeated from the start. In the case of independent 
amplitudes, the probability is unity that the analysis ends after a 
finite number of steps (a similar statement would not be true for 
correlated amplitudes). It may be found that the working memory 
of the machine is not sufficient for the histogram. To avoid this 
difficulty, one can assign a serial number y, and assume that all 
tests that continue further than the (y,—1)th step have ended at 
the v th step. Then the calculated probability p,, is the sum of 
the probabilities of ending the process at the vth,(vy-+ 1)th,... 
steps. 


4, OTHER PROBLEMS 


In information-transmission theory the Monte Carlo method 
can be used to solve a wide groupof problems in addition to those 
considered above. The most typical of them willbe discussed be- 
low. 


Detection of a Target and Determination of Its 
Position by Means of Counters of Quantized 
Signals [48b] 


Let us consider~the problem of radar detection of a target on 
an azimuth scan (on a sector at a fixed distance from a search 
radar). In a region not containing the target only Rayleigh- 
distributed noise pulses are present. The amplitudes of signals 
reflected from the target are assumed to be constant (the signal 
“does not fluctuate’’) and therefore the amplitudes ofthe resultant 
of signal and noise will be independent and distributed according 
to the Rice law. 

Suppose the receiver accomplishes quantization, i.e., its out- 
put is aunit signalifthe input voltage exceeds a level x, and a zero 
signal if the input voltage does not exceed x). As a result an azi- 
muth scan is found to be filled with a sequence of zeros and ones. 

Let the probability of occurrence of a noise pulse be denoted 
by py. By taking the variance of noise as the measurement unit, 
we have 


v3] dio 


co x = 
Py= f xe Fdx=e?. (5.27) 


x 


| 
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The probability of occurrence of asignalpulse, p,, depends on the 
signal amplitude and is equal to 


on xi+st 
ps(s) = f xe 2 J, (sx)dx. (5.28) 


Xx 


Since po> p,, the occurrence of target will result in an increased 
number of ones in the corresponding region of the azimuth scan. 
Several criteria based on this consideration may be suggested. 
We shall indicate two of them. 

a) Series type criterion. A counter of pulses is placed at the 
receiver output. As long as there are only zeros on the scan the 
counter remains set at zero. When the first one occurs on the 
scan, a unit is sent to the counter. Additional rules will be for- 
mulated by denoting by p, the outcome of a binomial test at the 
kth position of the scan, and by /, the value registered by the 
counter corresponding to the &th position. If one of the conditions 


either pq _3Mq_og—1_ = 0100, 
OF PB y-2bhg-1ha = 000, 

2.3<1,<6 and p,_.,_y%, = 000, 

3.6<1, ANd py _3q_oMp—ita = 0000, 


1. O<E 3 and { 
(5.29) 


is satisfied, the counter is returnedtozero(!,,,=0), i.e., the end 
of the region of signals reflected from the target is recorded. 
For the remaining combinations of zeros andones on the scan and 
values of the counter, one unit is addedto latter, i.e., J,,;=4,+1. 
The target is assumed to be detectedwhen/, >6. The criterion of 
beginning of the-signal is 1,—1 andthat of end of the signal region 
is /,>6,/,,;=0. The target azimuth is determined as the half- 
sum of the azimuths corresponding to the beginning and end of the 
signal region. 

b) Sequential-observation type criterion. At the initial position 
the counter is set at zero. If at a certain position a one occurs, 
4 is added to the counter; if a zero occurs one unit is subtracted. 
It is assumed that if /,>16, then /,.,—0 and the target is con- 
sidered detected. If 1,<—4, then /,,,=0 and a target-absent 
signal is generated. The target region is measured from the 
first pulse obtained from the target to the first pulse indicating 
absence of target. 

To determine the performance of both criteria we have to 
evaluate the detection probability and the false-alarm probability, 
and also find the distribution of the measurement errors of target 
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azimuth (from which the error expectation and variance are found). 
The detection probability (and the false-alarm probability) forthe 
series type criterion can be determined analytically by solving a 
system of recurrent equations with respect to the quantities p*, 
denoting the probability that at the instant nthe value & is recorded 
in the counter. 

The error distribution cannot be evaluated by elementary 
methods. The use of the Monte Carlo method enables us to solve 
both problems simultaneously. 

Electronic computers are usually employed for the solution of 
these problems. Locations of the machine memory are allocated 
for storing the content of the positions of an azimuth scan: ones, 
if signal (useful or noise) is present, and zeros in the opposite 
case. The method is implemented inthe following manner: K loca- 
tions, allocated for azimuth scan, are filled with zeros or ones 
according to a binomial distribution with parameter py in a noise 
region and parameter poinasignalregion.* The realization which 
is obtained of the observed scanis analyzedby means of the corre- 
sponding criterion, i.e., if the presence of target is recorded, 
then its azimuth is determined and is comparedwith the true one. 
The value of the error is enteredina histogram. By repeating the 
simulation process many times we can determine the required 
characteristics to any desired accuracy. The correctness of the 
calculation may be partially checked by comparing experimental 
detection probabilities with theoretical ones obtained by solving 
a system of recurrent equations for series type criteria. Such a 
check will provide evidence of the quality of the binomial-distribu- 
tion model. 

Calculations of detection characteristics by these methods 
ordinarily require the execution of about 1000 tests for a single 
version (see [48b]), 


Estimation of the Azimuth by the Maximum- 
Likelihood Method 


The joint probability distribution of the sequence of voltage 
amplitudes at the receiver output depends, for a fixed distance, 
on the target azimuth a, i.e., can be written in the form p(r,,.... 
r, %. Therefore, the problem of measuring the target azimuth 
may be formulated as that of determining the unknown parameter 
of a distribution. A method for solving this problem, called the 
maximum-likelihood method, has been developed in statistics. 


*We assume that a signal region is bounded on both sides by noise regions, 
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In this method the adopted estimate for the unknown parameter 
ob b 
Ti, 


is that value @ of the parameter which maximizes p( ss Fu Boe 


Here ro ..., ro are the observed values of the corresponding ran- 
dom quantities (in our case the voltage amplitudes). We shall as- 
sume that the radar receiver comprises a thresholddevice, which 
quantizes the signal into two values, and that the signal itself is 
noiselike, i.e., its amplitudes are Rayleigh-distributed and un- 
correlated, Then the probability of occurrence of the &th pulse 
in a “‘packet” of reflected signals is equal to 


2 
— *o 5.30 
a iti 


where x, is the quantization level, g is the signal-to-noise ratio, 
g(a) is the normalized [g(0)=1) radiation pattern of the antenna, 
6, is the azimuth of the &th pulse, and a is the true target azi- 
muth. We shall again denote by ». the result of quantization of the 
amplitude in the sth position, which can be either one or zero. 
The equation for finding the target azimuth by the maximum- 
likelihood method has the form 


2 tei @— 9%) =0, (5.31) 


where 


& (x) £' (x) 


n(x) = xe 
testo |e |g ao | 


are weighting coefficients depending on the estimate a, In order 
to find the required azimuth estimate, we must move the set of 
‘‘weights’’ y,—7(a—0,) along the sequence of zeros and ones ob- 
tained as a result of quantization of the signals in the considered 
sector of the scan, i.e., having affixed the weights 7,,..., 7, to 
the extreme left elements ofthe scan, we build up the sum of those 
weights against which there is a one, and then move the weights 
by one position to the right and again find the weighted sum, etc., 


a 
until a minimum (close to zero) of the weighted sums © p,, (a — 04) 
k=l 


is attained. The azimuth of the position corresponding here to the 
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weight (0), i.e., having argument zero, is taken as the target 
azimuth. 

To calculate the efficiency of the maximum-~-likelihood method, 
it would be necessary to find the probability distribution of the 
estimate a. The analytical solution of this problem is clearly 
faced with great difficulties, and therefore one naturally has re- 
course to the Monte Carlo method. Calculations by the Monte 
Carlo method proceed according to the following scheme: azimuth 
scanning is simulated by means of a sequenceof zeros and ones,* 
then weights {n,} are ‘“‘superimposed’’ on this sequence, and the 
root of Eq. (5.31) is found and is entered in a histogram. It is 
shown in [48b] that 100 such tests sufficed to construct the prob- 
ability distribution function to sufficient accuracy. 

We observe that by the same method we can determine the 
azimuth-measurement accuracy in the case of other types of signal 
fluctuation. 


The Probability Distribution of Noise Power at 
the Output of a Radio-Relay System ([54, 55]) 


The performance reliability of a communication radio-relay 
system is determined by the total intensity of the noise voltages 
arising in various sections of the system betweentransmitter and 
receiver stations. Therefore, the correct designof sucha system 
requires a knowledge of the statistical characteristics of the total 
noise and, in the first instance, of the probability distribution of 
noise power at the system output. Of special interest is the be- 
havior of the distribution function for large values of the argu- 
ment (for engineering purposes it is required to estimate the 
threshold noise power that is exceeded with probability equal to an 
assigned small number p,,,—0.01; 0.001). 

The mathematical problem is formulated in the following man- 
ner: asequence of random quantities §,, & .-., §& with distributions 
F,(x), F,(x), -.., F,(x) is given. Itis requiredtofindthe distribution 
function of the sum 


Rte eee the 


Owing to the large number ofterms init, it would be natural to use 
limit theorems of probability theory. However, this, atfirst sight 


*The probability of occurrence of a one in the &th position is determined by 
a 
2 
0 


formula (Section 30); the probability of a noise is Py=e 2. 


4, OTHER PROBLEMS 193 


obvious, approach is facedin anumber of cases with serious com- 
putational difficulties. This is explained by the fact that the limiting 
distribution is found sometimes not tobe normal. We shall restrict 
ourselves to the simple case when the random quantities & are 
independent and identically distributed. 

According to experimental data the distribution functions are 
of the form 


1-24) xe> 9(x), 
F(x)= x (5.32) 
0, x* < o(x), 


where a(i<a< 4) is a parameter and 9(x)—c(>0) for x->oo. It 
is shown in probability theory that the limiting distribution of the 
sum of independent random quantities distributed according to one 
and the same law (5.32) proves tobenormal only in the case when 
a>2. It is evident that the calculation ofthe threshold power is in 
this case elementary. Difficulties occur when the parameter «a 
Satisfies the inequality «<2. The limiting distributions are then 
found to belong to a class of stable distributions of which is known 
only the form of the characteristic function 


fa) = exp[itt—elt|*{1-+88 Wee: a)}], 
where |§|<1, e>0 and 


tan 5% aX, 
o (é, a) =: 9 
= log|t|, a= 1. 


We can easily write down the distribution function by representing 
it as the Fourier transform of the characteristic function. How- 
ever, the calculation of the probabilities of large deviations re- 
quires the evaluation of integrals of very rapidly oscillating func- 
tions. In this case, even by using electronic computers, it is 
difficult to obtain a satisfactory answer. 

The use of the Monte Carlo method enables us to obtain the 
required results by a reliable, standard method: a sequence of 2 
random numbers §,, §,..., §, distributed according to (5.32) is 
generated and the sum 


ae ee 
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is entered in a histogram. According to [54] and [55], to ensure 
an acceptable accuracy in the determination of the distribution 
function as regards its tails (we have in mind arguments x, for 
which p (8 > Xo} = Piyresholg) NOt less than 5000 tests are required. 

The probability distribution of the components is assumed most 
often to have the form of Pareto’s law 


Pats 
F(x) = - 


If the noise voltages of adjacent sections are not identically 
distributed (and are possibly correlated), then by introducing ap- 
propriate modifications in the procedure for simulating the ran- 
dom quantities, we can find the probability distribution of their 
sum © We can similarly consider also more complicated func- 
tions of &, .... &, 


Methods of Decoding Messages [8] 


Before a message is transmitted, it has to be coded, At the 
receiving end of the line, to determine the content of the signal 
received, the latter has to be suitably processed (decoded). The 
coding may be carried out in the individual letters of an explicit 
message (that is, each ‘‘word’’ formed by 2 code symbols denotes 
one of the letters of the alphabet). In other cases ‘‘words”’ of n 
code symbols are complete messages (thus it is possible to trans- 
mit ;”" different messages, if 7 is the number of values a symbol 
can assume), Noise present in the communication line and in 
the receiver distorts the code symbols, and therefore the re- 
construction of the initial message is liable to be in error. 
This gives rise to problems in the design of noiseproof codes 
and in the investigation of reliable methods of decoding. 

We shall restrict ourselves to binary codes, the symbols of 
which assume only two values: 0 andl. These symbols are trans- 
mitted by two electric signals S)(¢) and S,(¢) differing from each 
other in one or several parameters (amplitude, duration, etc.). 
Owing to noise in the communication channel and in the receiver, 
the signals at the receiver output, z(¢), do not coincide either with 
So(t) or with S,(). Each received signal can be identified by some 
method or other with one of the possible signals: either S, or S,. 
In order to detect and correct errors we add, to 2-symbol code 
‘‘words’’ carrying useful information, more check symbols, In 
Wagner’s code this is done in the following manner: the sum of all 
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n symbols in a word is calculated; ifthe sum is even (odd), a zero 
(a one) is put in the (n-++ 1)thposition. Therefore, in each received 
(1++ 1)- symbol word the sum of the symbols must be even. If it is 
found that in the message received this condition is not satisfied, 
a single error can be corrected, for example, in the following 
manner. On the basis of known statistical properties of noise we 
calculate, for each symbol inthe word, p)= p(S,/z) and p, = p(S,/z); 
i.e., the a posteriori probabilities of the occurrenceof S, and S,, 
with the condition that the signal z has occurred at this position. 
We assume that the original signalis S;(é=0, 1)if the a posteriori 
probability p, corresponding to itisthe larger.* For each position 
we work out the difference Ap=p,— p,. We decide on the position 
of the wrong symbol in the ‘‘word’’ received (when the latter has 
odd parity) by considering the set of differences, Ap, and finding 
the position for which |Ap| is a minimum. It is thus possible to 
correct, with a certain probability, single errors. 

Other methods of coding and identifying the received signals 
with transmitted messages are known. The quality of a coding 
method is estimated by the probability oferror in the reception of a 
message, These probabilities can be compared by simulating on a 
computer the transmission of ‘‘words,’’ by using pseudo-random 
numbers representative of transmitted symbols distorted by chan- 
nel noise (to do this the probability distribution of noise must be 
assigned). By repeatedly simulating reception by various methods, 
we can calculate the frequency of correct identification of the re-~ 
ceived message for each of the methods compared. In the case 
when the symbols .are independent, it suffices to carry out the 
simulation with oné ‘‘word’’ only (for example, the word consisting 
of all zeros). 


*We can proceed in this mamner if all combinations of code symbols are per= 
missible and there is no correction between words, 


Chapter VI 


Generating Uniformly Distributed 
Random Quantities by Means of 
Electronic Computers 


1, COMPARISON OF VARIOUS METHODS OF 
GENERATING RANDOM QUANTITIES 


The success of calculations by the Monte Carlo method on an 
electronic computer is determined by two basic factors: 

a) the quality of the source of random numbers, 

b) the choice of a rational computing algorithm. 

The problem of choosing the method of generating random numbers 
is one of paramount importance, since its successful solution de- 
cides in many cases the success of the solution of the whole 
problem. 

The simulation of a given random process requires the possi- 
bility of sufficiently economical construction of sequences of 
random numbers corresponding to some assigned distribution 
laws. We have observed that to obtaina value of a random quantity 
with an assigned distribution law, one or several values of uni- 
formly distributed random numbers are usually employed. There- 
fore the problem of computer generation of uniformly distributed 
random numbers is of particular importance, 

The latter problem can be solved by various methods, 

The first method, one that is widely employed today, consists 
in the following. Random numbers are obtained in a computer by 
programming some recurrence relation. This means that each 
successive number 4,,,; is generated from the previous one a, (or 
from a group of previous numbers) by employing some algorithm 
consisting of arithmetic and logical operations. Such a sequence 
of numbers, without being random, may nevertheless satisfy 
various statistical criteria of randomness, Therefore, suchnum- 
bers are called pseudo-random, 

The main advantages of the programming method of obtaining 
random numbers are, among others, the possibility of checking 
computer operation during the process of solving a problem (the 
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possibility of repeatedly reproducing a calculation) as well as the 
simplicity of the algorithm for generating a pseudo-random number, 

The main shortcoming of pseudo-random numbers is the dif- 
ficulty of a theoretical assessment of their statistical properties. 
This is particularly evident when various multidimensional dis- 
tributions are generated by proceeding from a sequence of uni- 
formly distributed pseudo~random numbers, In addition, all 
sequences of pseudo-random numbers generated by programming 
methods are periodic, and therefore, even from a practical point 
of view, very long sequences will not be random. Several methods 
of ‘‘improving’’ sequences of pseudo-random numbers are avail- 
able today and will be discussed indetail below. However, the use 
of these methods can somewhat reduce the speed of operation of 
the computer, 

Methods of obtaining uniformly distributed pseudo~random 
numbers are described in Section 2, 

The second method of obtaining random numbers consists in the 
use of a special computer accessory, namely a random-number 
generator, which transforms the results of a physical random 
process intoa sequence of binary digits inthe computer, i.e., gene- 
rates a random quantity. Usually, the register wherethis random 
quantity is located has an address in the general system of addresses 
of the computer. In that case, accesstothe random-~number gene- 
rator is obtained simply by referring to the computer memory. 

The use of a random-number generator increases the speed of 
computation, since at each step of operation ofthe computer a new 
random number is written in a fixed standard location of the 
memory. 

A disadvantage of such a method is a certain instability of the 
random-number generator owing to which it needs periodic check- 
ing. Another disadvantage is the impossibility of exactly reproduc- 
ing the results of computation of a problem (computations cannot 
be checked by performing them twice). 

It is usually reckoned that if the Monte Carlo method is em- 
ployed systematically on a given computer, thenthe presence of a 
checked-up random-number generator is tobe preferred to the use 
of pseudo-random numbers, If, however, the Monte Carlo method is 
employed on a computer only occasionally, then the use of pro- 
gramming methods is more advantageous, since the operation of 
random-number generators and theirmaintenance require consid- 
erable expenditure of labor. The design and operation of random- 
number generators are described in Section 4, 

A third, comparatively rarely used, methodconsists ininsert- 
ing tables of uniformly distributed random numbers into the 
working memory of the computer. 
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The use of a part of the working memory of a computer for 
storing tables of random numbers is usually impossible since the 
capacity of the computer memory is already utilized for storing 
information of various types directly related to the process to be 
simulated, As to the insertion of uniformly distributed random 
numbers into the working memory of the computer by means of 
drums or tape, the longer access times of such low-speed memory 
devices substantially reduce the speed of operation of the com- 
puter. In addition to this, in order to solve large problems, 
hundreds of thousands or even millions of random numbers are 
often required, and this exceeds by many times the volume of 
tables of uniformly distributed random numbers available today. 

All that has been said above leads to the conclusion that the 
method of inserting uniformly distributed random numbers in the 
computer memory can be of only auxiliary value, This method is 
usually employed in the solution of small problems. 


2. OBTAINING UNIFORM PSEUDO-RANDOM 
NUMBERS ON COMPUTERS 


It has already been pointed out above that by ‘‘sequences of 
pseudo-random numbers’’ we mean sequences of values a, of func- 
tions of an argument i (aninteger) thatresemble, in their proper- 
ties, sequences of random numbers and satisfy a determinate 
system of accepted statistical criteria. Pseudo-random uniformly 
distributed numbers are generated on computers by means of spe- 
cial programs. The designing of a program to compute uniformly 
distributed pseudo-random numbers must take into account the 
following basic requirements. 

a) The program must generate numbers having very little sta- 
tistical correlation. A sample set of pseudo-random numbers 
generated by the program must satisfy prescribed criteria of 
‘trandomness.’’ 

b) The distribution of pseudo-random numbers generated by the 
program must approximate as closely as possible to a uniform 
distribution. 

c) The program must be stable. This means that the distribu- 
tion of the random numbers generated must not vary during the 
operation of the program. 

Let us consider some of the methods available for obtaining 
uniformly distributed pseudo-random numbers. 


1. Analytical Methods of Obtaining Pseudo-Random Numbers 


a) A whole series of methods of obtaining uniformly distributed 
pseudo-random numbers is based on selecting the central digits 


| 
| 
| 
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of a product. This method, which was firstly proposed in 1951 by 
J. von Neumann [223], consists in the following. An arbitrary num- 
ber a, consisting of 22 binary digits is selected to start a recurrent 
process. The quantity «, issquared. The quantity a? consists of 4k 
digits from which a number a, is chosen, consisting of the 2k 
central binary digits [from the (k-+ 1)th to the (3k)th]), Then the 
process is repeated starting from z,, etc. 

Such a type of recurrent process does not give a satisfactory 
sequence of random digits, and the distribution of the pseudo- 
random numbers obtained by this method differs from a uniform 
distribution. 

Considerably better results are obtained by a modified version 
of von Neumann’s method, in whicha pair of numbers, for example 
a anda,, is arbitrarily chosen. Their product «x, is formed and 
its central digits are used for the number a,, The process is re- 
peated for a, and atogives,, etc. This type of recurrent process 
gives pseudo-random numbers with a distribution closer to the 
uniform distribution than does von Neumann’s ‘‘mid~square’’ 
method in its original version. 

A series of similar methods for obtaining uniform pseudo- 
random numbers has been suggested by Lehmer 1189]. They consist 
in the following. Firstly an initial number «a is chosen consisting 
of 2k figures (of 2k binary digits on an electronic computer). The 
product «2 is formed and the last 2 figures of this product are 
chosen, yielding a new number «)}. Thenumber a, is multiplied by 
a constant factor C and the first 2e figures of this product are 
selected to give a number «{. The first 2k figures of the product 


aa 


a2 are then selected and the number a;’is formed. The number a) 


“ 


is multiplied by C and the last 2k figures of the product Ca,’ are 
selected to give the number al”, Digit-by-digit (or ordinary) addi~ 


tion of the numbers a; and aj’ gives the next pseudo-random num- 
ber a. 

The methods described above (also termed truncation methods 
of pseudo-random numbers) give in the end a periodic sequence 
with period not exceeding 2”, 

b) A series of methods of obtaining pseudo-random numbers is 
based on the use of residues, On the ENIAC computer* Lehmer 
[189] has used the recurrence relation 


On41 == Ra, (mod M), 
where k= 23, M=108-+-1. 


*The ENIAC computer uses the decimal number system, 
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Such a method enables us to obtain a sequence of eight-figure 
decimal numbers with period 5882352.* 

In the paper [251], pseudo-random numbers uniformly dis- 
tributed over the interval (0, 1) have been generated by using the 
following recurrence relation 


4741 2-%5,, 
By41= 5178; (mod 2”), By=1. 


Such a set has a period equal to 2 ~ 10?2, 
The set of pseudo-random numbers generated by means of the 
recurrence relation [247] 


F441 28, 
Bj41=5"%, (mod 2), fy==1, 


has period 2%—~ 2x10", while the set obtainedfromthe recurrence 
relation 


41 10-8. 
Pj41= 78, (mod 10"), By= 1, 


has period 5107, 

The first method is used on the SVAC computer and the second 
on the OARAC computer. 

On the UNIVAC electronic computer a recurrence relation of 
the following form [212] has been used 


y= 1, Oy = 1078, 
Basi=7"*'8, (mod 10%), 


The resulting sequence of pseudo-random numbers has period 
5 108, 

A number of methods for generating uniformly distributed 
pseudo-random numbers have been developed, using a Fibonacci 
type series. For example, good results have been obtained by 
using the sequence 


*The length of a period has been calculated by an empirical method, 


2, OBTAINING UNIFORM PSEUDO—RANDOM NUMBERS ON COMPUTERS 201 


Fy==0, F,=1, 


On+2= Faia 
where 
Paso= [Pasi t F,] (mod M). 


For the SEAC computer, M= 2 has been used. The period thus 
obtained was approximately equal to 2.510", 

Van Weingarden [251] has suggested a modified version of the 
last-mentioned method. A group of numbers ap, a, a,..., a, is 
arbitrarily selected, andalinear combination of them with integral 
coefficients is formed, Its fractional part is thenumber a,,,, The 
process is repeated on the set 4, a, ..., @» @,4,, etc. If the num- 
bers a, ..., @, are uniformly distributed over the interval (0, 1), 
then, from the elementary theorem stating that the sum of two 
uniformly distributed quantities, taken modulo 1, is uniformly 
distributed, it follows that «,,, is also uniformly distributed. It 
can easily be shown for p=1 that any two quantities of this se- 
quence are independent. This cannot be said of triplets, since 
they are found in certain cases to be linearly dependent. 

With such a scheme we obtain longer periods, since, before 
repetition begins, the whole set of p-+-1 numbers must again as- 
sume its initial values. 

We observe that in obtaining pseudo-random uniform numbers 
generated according to the formula 


j-1 


On 47 = 2 C,2,,, (mod M), 


the choice of the initial values «,, a, ...,% 2 plays an important 
role. Thus, with a suitable choice of the first two initial quantities 
%, &, ithas been possible to obtain ona SEAC computer a sequence 
of pseudo-random numbers satisfying a system of various statisti- 
cal criteria [105]. Pseudo-random numbers are generated accord- 
ing to the formula 


454.1 == [4;-+ 4,_] (mod 4), 
where 


uy=t, a,=5".2-”, 


c) Several procedures for obtaining uniformly distributed 
pseudo-random numbers are based on Lehmer’s method [251]. 


202 Vl. GENERATING UNIFORMLY DISTRIBUTED RANDOM QUANTITIES 


The latter relies on the known fact that for an irrational number 
§ the set of numbers of the type {n§}(where {x} denotes the frac- 
tional part of the quantity x) comprises numbers arbitrarily close 
to any number in the interval (0, 1), since they are everywhere 
dense in (0, 1). Lehmer’s method reduces to constructing a se- 
quence a, = {n6} of nonrandom correlated numbers the distribution 
of which (in the number-theoretical meaning ofthis word) is rigor- 
ously uniform, This method is used with advantage in those cases 
where the main interest is the uniformity ofthe distribution while 
the presence of correlation is of little importance. 

Various schemes based on number-theoretical methods have 
been suggested for generating pseudo-random sequences by N. M, 
Korobov and I. I. Shapiro-Pyatetskii [39]. 


II, Obtaining Uniform Pseudo~Random Numbers 
by the Mixing Method. 


For obtaining uniform pseudo-random numbers on the elec- 
tronic computers ‘‘Strela,’’ BESM, ‘‘Ural,’’ etc., several methods 


exist that utilize specific features of these machines. These | 


methods are based on simulating the random, chaotic mixing of the 
digits of the mantissa of pseudo-random numbers. Among these 
methods the most fully developed are those for ‘‘Strela’’ type 
electronic computers. We shall cite, astypical ofsuch programs, 
those for generating pseudo-random numbers for the ‘‘Strela’’ 
computer developed by I. M. Sobol’ and D. I. Golenko. Both these 
recurrent methods are realized by means of a three-instruction 
program, 

In the program developed by I. M. Sobol’ 57], the number Opay 
is obtained from the number «a, by three operations: 

1) the number «a, is multiplied by 101’; 

2) the number representing the product 10!’ a, (the ‘‘image’’ of 
107 «,) is shifted seven places to the left (so that zeros will be 
found at places from 36 to 42); 

3) the absolute value of the number obtained is taken (and at 
the same time the number is normalized), The result is a,,,. 

We observe that the properties of the sequence {a,} depend not 
only on the value assigned for o, but also on the method of rounding 
off used in the computer. 

Moreover, the number 10!’ and the number for determining the 
value of the shift are placed inthe unit for issuing the constants of 
the computer. Thus, only one location of the working memory is 
required to compute a,. 

Finally, the risk of degeneracy of the sequence, that is the 
probability of some «, reducing to zero, is extremely small in 
practice. In fact, for a, to reduce to zero it is necessary that 
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0.5 <a,_,-10% <1; 


otherwise, nonzero digits will occur in the exponent of the prod- 
uct a,_,X10"%, and after the subsequent shift these digits will be 
found in the mantissa of «,, Thus, inorder that the sequence may 
degenerate, a,_, must belong to a very narrow interval of values. 

Several tens of values of a have been used in the solution of 
various problems, but no cases of degeneracy of the sequence 
have occurred. 

The three-instruction program developed by D. I. Golenko [24] 
has also been repeatedly used in the solution of a series of prob- 
lems by the Monte Carlo method. 

Pseudo-random numbers uniformly distributed over the inter- 
val (0, 1) are generated by three single-step computer operations, 
An initial assigned value 4) 0 (which can be also random) is 
placed to begin with in the location a, in which the subsequent 
numbers of the sequence are to be generated. 

Pseudo-random numbers are calculated by recurrence; the 
number «,,, is generated from a, according to the following pro- 
gram, 

1. The image of the number «, in the location a is shifted 
seven places to the left (the places 36to 42 are ‘‘cleared’’) and is 
placed in the location 8. 

2. The special addition operation* (the operation 02) is carried 
out on the contents of the locations a and 4, the first number added 
being the content of the location 6. Asa result, zeros arise in the 
places from 36 to 42, while the content ofthe places of the mantissa 
of the locations @ and bis chaotically mixed. The result is written 
in the location a, 

3. The absolute value of the content of the location a is evalu- 
ated (with subsequent normalization). The result is written again 
in the location @ (the operation 04). This result is the number a,,,. 

In the instruction~code language of the ‘‘Strela’’ computer this 
program will be written thus 


Operation ist 2nd 3rd 
code address address address 
1 14 a 4007 b 
2 02 b a a 
3 04 a — a 


*The special=-addition instruction accomplishes the logical digiteby-digit addi- 
tion ofthe contents of the first and second addresses, the exponent of the first number 
added being retained, The first is written in the third address, 
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Thus two locations of the accumulator, a main one (a) and an 
auxiliary one (b), are needed for obtaining pseudo-random num- 
bers. In the intervals between the generation of pseudo-random 
numbers the location 6 may be used as a working location. 

Just as for the pseudo-random number program described 
above, the probability of degeneracy of the number sequence 
generated is very smallin practice. Degeneracy occurs only in the 
case when the content of the location 4 after the shift is the digit- 
by-digit complement of the content of the location a. Only in this 
case does the special addition operation result in a zero in the 
location a. Cases of degeneracy of the sequences have never been 
observed, in spite of the fact that pseudo-random numbers have 
been used for several years in a large number of problems with 
various a. 

The first of the programs described above has an aperiodicity 
stretch (see Section 3) of the order of 80,000, the second program 
one of the order of 300,000. The length of the cyclicity period 
generated amounts to 50,000 numbers for thefirst program and to 
about 100,000 numbers for the second one. 

The program for obtaining on the BESM computer pseudo- 
random numbers uniformly distributed over the interval (0, 1) 
comprises nearly the same machine operations as the correspond- 
ing program for the ‘‘Strela’’ computer. Thefollowingtable gives 
the four-instruction program for the BESM computer [50] 


Ordinal number Operation Address Operation content 
of instruction code 
I r Ww 
k 5% e@ 0047 a+1 7-digit right shift 
k+1 so @ 0007 2 +2 7T-digit left shift 
k+2 SA a+la+il1 a+2 Special addition 
k+3 |TM| a+2 0000 e« # £=Transfer of modulus 
of number with 
normalization 


This program differs from the corresponding program for the 
‘Strela’? computer owing to the different computer-operation codes 
for the ‘‘Strela’’ and the BESM. Thus the second shift in the pro- 
gram for the BESM computer is necessary since without it there 
occurs degeneracy of the pseudo-random number program with 
consequent ‘‘zero output.’’ \ 

The length of the aperiodicity stretchfor this program does not 
exceed 50,000. As to the length of the period of the sequence of 
pseudo-random numbers, this is close to.5000 for several initial 
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values of a. In the process of solving certain problems by the 
Monte Carlo method, several modifications have been introduced 
in the program for the BESM computer: the number of positions 
shifted was increased to 8, etc. This has increased the length of 
the aperiodicity stretch to 100,000-150,000, but the quality of the 
pseudo-random numbers thus generated has appreciably deteri- 
orated [50]. 

As to the single-address electronic computer ‘‘Ural-1,’’ the 
program for obtaining on it pseudo-random numbers uniformly 
distributed over the interval (0, 1) is more cumbersome than the 
programs described above. 

Provided they are taken in sufficiently long sequences, the 
pseudo-random numbers obtained by means of the above-described 
programs usually satisfy the system of statistical criteria adopted 
for testing the randomness of the distribution. These criteria will 
be described in detail in Section3. However, the use of statistical 
goodness-of-fit criteria does not solve the problem of a thorough 
assessment of the distribution of pseudo-random numbers. The 
fact that a statistical criterion is obeyed merely means that we 
have no grounds for rejecting the hypothesis of uniformity of the 
distribution of the pseudo-random numbers. This, however, does 
not by any means guarantee that this hypothesis is the correct one. 

In addition to this, in a series of statistical goodness-of-fit 
criteria (for example the ;? test) the closeness of the empirical 
distribution to a uniform distribution may have to be variously 
assessed depending on the sample size n. Therefore, for one and 
the same empirical distribution we may accept the hypothesis of a 
uniform distribution for one value of n and reject it for another 
value of n. 

What has been said above does not detractfrom the importance 
and usefulness of the given statistical goodness-of-fit criteria 
for assessing the empirical distribution of a sequence of pseudo- 
random numbers. Goodness-of-fit criteria are a very effective 
method of analysis of the quality of pseudo-randomnumbers. This 
is especially true in those cases inwhichnot one, but a system of 
statistical criteria supplementing and reinforcing each other is 
used. We shall merely remark that ina number of concrete cases, 
just as with other methods of statistical analysis, the use of 
goodness-of-fit criteria can be rather inefficient or altogether too 
difficult to carry out. 

The degree of closeness of an empirical distribution of pseudo- 
random numbers to the theoretical uniform distribution may also 
be investigated without the use of statistical goodness-of-fit 
criteria, if we analyze the distribution of the separate digits of 
which the pseudo-random numbers consist. 
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We shall assume below that the pseudo-random numbers in 
question have been obtained by means of the programs described 
above from an initial random number a. Otherwise we would not 
be entitled to apply to pseudo-random numbers such probability- 
theoretical concepts as randomness, probability, distribution, 
etc. 

Let us consider the distribution of the values of separate digits 
for several pseudo-random numbers generated from each other 
(i.e., obtained sequentially). If a pseudo-random number obtained 
is represented in the form of a nonnormalized a-digit binary 
number with zero exponent, then, in thecaseof uniform distribu- 
tion, the probability of occurrence of a zero (or one) at an arbitrary 
place of the mantissa is equalto0.5. In reality, for each position, 
this probability is somewhat different from 0.5. 

Suppose a sequence of pseudo-random numbers is generated 
and let the symbol ¢{" denote the ith digit in the eth calculated 
pseudo-random number, «,. Then the probability of occurrence of 
a zero in ¢\"’ may be written in the form 0.5(1 +2). 

Let us consider the connection between the distribution of the 
values of the separate digits of nonnormalized pseudo-random 
numbers and the probability distribution of pseudo-random num- 
bers themselves. Let us introduce the notation 


p= max P {:” —0}=0.5(1+ dmax) 
lgi<a 


and let us estimate the relative deviation Aof the distribution of a 
Sequence of pseudo-random numbers (in which all digits of the 
mantissa have a ‘‘probability distortion,’’ equal to 8,.x) from the 
uniform distribution. We shall disregard the correlation between 
digits and all digits of the mantissa will be assumed independent. 
Denoting by m the number of zeros in a pseudo-random number, 
we can easily convince ourselves that 


A= (i+ Bmax)” (ai— Bmax)?” —1=(m— n) Bmax + oO max)» 


Therefore, for m~ 5 


A~O, 
and form=—0 orm=—naA 


|A| ~ n8max- 
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If we fixthe number A,(1 > 4) > 0)and set ourselves the task of 
finding 8n.x for which |A| <A, then 


Bmax < ze} * 


We have thus shown that for any preassigned A, there exists a 
value naxx such that if ’<8,,,; for all digits of the mantissa of 
pseudo-numbers, then|A| < A, forthe random numbers themselves. 
It remains to establish the value 4,,. for digits of pseudo-random 
numbers obtained by means of the programs describedabove, and 
thus to guarantee a relative departurefrom the uniform distribution 
not exceeding the value ni,,,,. 

It will be recalled that the last instruction in all three programs 
described above for obtaining uniformly distributed pseudo- 
random numbers (for the ‘‘Strela’’ and BESM computers) is the 
instruction for taking the modulus with subsequent normalization. 
Since the investigation of the distribution ofthe digits of a pseudo- 
random number a,,, after its normalization is made very difficult 
by the deterministic nature of individual digits (the value of the 
first digit, for example, is always one, and that of the last is al- 
ways zero), it is expedient to assess the value of 5,, for digits of 
the mantissa of the pseudo-random number before its normaliza- 
tion. 

As has already been indicated above, the initialnumber «, from 
which the sequence of pseudo-random numbers is generated (this 
initial number is not used as a pseudo-random number) is taken 
to be a random number with independent, random and uniformly 
distributed digits.* We can easily convince ourselves that 8,;,—0 
for the first pseudo-random number «a,. The calculation of the 
value of 8m,:x for a, (k>>2) isconsiderably more complicated. Al- 
though all calculations have been carried out only for the second 
of the above-described three-instruction programs for generating 
pseudo-random numbers on the ‘‘Strela’’ computer, the procedure 
suggested for calculating 3,,, may be employed also for other 
methods of obtaining pseudo-random numbers. 

Calculations carried out have shown that, for digits from the 
first to the twentieth, the formula for computing the probabilities 
for 1<k< 15** has the following approximate form 


|P(e—o0}—0.5|< 0.0001 (1<i< 20). 


*The number of digits is equal tothe number of digits of the corresponding elec- 
tronic computer, 
**No calculations have been carried out for# > 15. 
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In other words, for the first twenty digits 6g., < 0.0001, and for 
n= 20 we have A < 0.002. 

It is important to note the following. The calculations carried 
out guarantee the uniformity of the distribution only of the first 
twenty (out of 35) digits of the mantissa of a pseudo-random 
number. This means that the probability distribution of the whole 
pseudo-random number may have periodic departures from uni- 
formity with period not exceeding 2~”. 

Thus an analysis of the distribution of the values of the digits 
of several sequentially obtained pseudo-random numbers guaran- 
tees uniformity to 0.5% accuracy over the interval(0, 1) for at least 
15 sequentially obtained pseudo-random numbers, 

If the solution of a problem by the Monte Carlo method utilizes 
a sequence of pseudo-random numbers which have been given a 
preliminary check by means of a system of statistical tests, it is 
expedient to use just one program of uniform pseudo-random num- 
bers without any improvement whatsoever. Otherwise a random 
‘“‘perturbation’’ of the digits of the pseudo-random number must 
be carried out periodically every 10-15 sequentially generated 
pseudo-random numbers. 

Good results are obtained, for example, by using at the same 
time two programs in the following manner: at first an initial num- 
ber a, is chosen andis written inthe location a, and the generation 
of pseudo-random numbers begins according to thefirst program. 
After 10-15 pseudo-random numbers have been generated, we 
transfer momentarily to the second program, which generates, 
from the last pseudo-random number (the content ofthe location a), 
a new pseudo-random number whichis usedto start again the first 
program, etc. Such a use of two programs enables the length of 
the period to be greatly increased.* 

We proceed now to a description of a system of statistical 
criteria to test the quality of pseudo-random numbers. 


3. CRITERIA TO TEST THE QUALITY OF UNIFORM 
PSEUDO-RANDOM NUMBERS 


a) Criterion for Testing the Periodicity of a Sequence 
of Pseudo-Random Numbers 


A sequence of pseudo-random numbers obtained by means of a 
program has the following properties. The first L sequentially 


*If "perturbation" is carried out every m numbers, the average length of the 
period increases, as has been shown by I, M, Sobol’, by ym times. 
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obtained pseudo-random numbers, beginning with a,, will be all 
different, but the (L+ 1)th pseudo-random number 2a,,,., coincides 
with one of the numbers previously obtained, say, «, (l<i<J) 
(Fig. 29). Further on, the subsequence of pseudo-random num- 
bers beginning with a, and ending with «, will be periodically 
repeated. The number Z is referred to asthe length of the aperi- 
odicity stretch, and the number (L—i-+1) as the length of the 
period. Of course the length of the aperiodicity stretch and the 
length of the period will be different for various values of the 
initial quantity ,. 


Len f 
aperiodicity stretch 


a &e in Fj igh Gy Ae aL-| 23L-i 


Ordinal number 
Length of period = Length Length of period of number 
of 


period 


Fig. 29. Representation of sequence 
of pseudo-random numbers. 


It must be pointed out that, in solving problems by the Monte 
Carlo method on computers, it is desirable that the number of 
pseudo-random numbers used be not larger than L. Otherwise the 
probability process would be simulated by means of repeating 
pseudo-random numbers, which leads as a rule to incorrect re- 
sults. 

Let us consider the problem of determining the length of the 
aperiodicity stretch. 

In an electronic computer, only a finite number N of different 
pseudo-random numbers can be represented. For pseudo-random 
numbers distributed over the segment (0, 1)wehave N=2', where 
1 is the number of digits in the mantissa of the computer. 

Suppose we had, instead of N pseudo-random numbers, N 
ordinary random numbers with equal probability of occurrence for 
any of them. In this case the length of the aperiodicity stretch L 
would be a random quantity the distribution of which would be as- 
signed by the probabilities 


Nix 


P {L= x} = Wont 


(x= 1, 2,..., N). 


Let us consider the asymptotic distribution of the random 
quantity 1=4 for N->oco. It may be shown that 
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| 
to| 8 
. 


jim PL ap <2}=1—e 


The latter means that the random quantity 4 one is asymptotically 
distributed as y? with two degrees of freedom (see Section 3c). 


Let ai, ..., a be various initial values with which we begin 
to generate a sequence of pseudo-random numbers, and let 
L,. ..., L, be the corresponding lengths ofthe aperiodicity stretch. 


By employing similar reasoning, it may be shownthat the random 
quantity 


m 
Dui 
1H 


is distributed asymptotically as y? with 2m degrees of freedom. 
The last fact enables us to use the x? criterion for assessing em- 
pirical values of the length of the aperiodicity stretch. 

Let us cite, for example, data obtained by generating pseudo- 
random numbers on the ‘‘Strela’’ computer by means of the pro- 
gram described on page 203. Calculations gavethe following values 
of the aperiodicity stretch 


L,=2.3-105, 1,=3.5-105, L,=2.8 + 105, 
In the *‘Strela’?’N = 2%, Hence 


Li+ 3+ 3 


NW = 8.7. 


In our example m=3, which corresponds to ay? distribution with 
six degrees of freedom. The value 8.7 does not exceed the 5% 
confidence limits for y2 (see Section 3c), Thus the experimental 
data of the example quoted indicate that, as far as the periodicity 
parameters are concerned, pseudo-random numbers simulate 
random numbers well. 


b) Criteria for Testing Randomness 


A system of tests designed to check the ‘‘randomness’”’ of a 
distribution has been developed by Kendall and Babington- 
Smith, The system consists of four tests: the frequency test, 
the serial test, the gap test and the poker test. 

A set of pseudo-random digits satisfying all these tests is 
called locally random ((157, 179)). 
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All the above-mentioned tests are characterized by one and the 
same general property: the pseudo-random numbers (or digits in 
them) subjected to atest areclassified according to certain indices 
(different for each test), and the empirical distribution obtained is 
compared with the theoretical one. They?criterion, Kolmogorov’s 
criterion and the w? criterion are usually employed for the com- 
parison. 

Here is a description of Kendall’s system of tests. 

The frequency test involves counting the number of pseudo- 
random numbers that are found in subintervals of the range of 
definition of the pseudo-random numbers, i.e., the segment(0, 1). 
Usually the segment (0, 1) is subdivided into 10 to 20 equal inter- 
vals, 

The frequency test is successfully employed also inthe case of 
a multidimensional distribution, In this case we subdivide the 
multidimensional cube 0<¢x,<1, 1<i<ninto elementary small 
cubes with equal surfaces, andthen we count the number of pseudo- 
random numbers that are found in each of the small cubes from a 
sample of a sufficiently large number of pseudo-random numbers. 
In practice, the size ofthe set of pseudo-random numbers investi- 
gated is rarely smaller than 10,000 andis usually 50,000-100,000. 
Of course, in investigating uniform pseudo-random numbers, the 
theoretical distribution to be compared with the empirical distri- 
bution is uniform, and the expected number of pseudo-random 
numbers in any interval is equal to Nmes (V;), where Nis the 
volume of the sampling set, and mes (V,) is the length of the sub- 
interval (or the volume of the elementary small cube). 

The serial test is the simplest one and involves counting the 
zeros or ones in the digits.of the set of pseudo-random numbers 
for which the test is carried out. In the case of pseudo-random 
numbers uniformly distributed over the interval (0, 1) the ex- 
pectation of the content of each place ofthe mantissa of a pseudo- 
random number is equal to '/,, since the probability of the oc- 
currence of a zero or one is equal to 0.5.* 

The ‘‘poker’’ test involves counting, in a sample set of large 
size, the number of pseudo-random numbers containing various 
combinations of binary digits. For example, let 14,000 pseudo- 
random numbers each containing 10 binary digits be subjected to 
testing. Table 11 shows an empirical distribution ofthe frequency 
of various combinations of binary digits in comparison with the 
theoretical uniform distribution.** 


*This applies to nonnormalized pseudo-random numbers with zero exponent, 
**Mathematical Tables and Other Aids to Cornputations, Vol, 101, No, 53 (1956), 
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Table 11 
Data Theoretically 
of tests calculated data 
10 zeros 12 13.67 
9 zeros 1 one 149 136.72 
8 zeros 2 ones 607 615.23 
7 zeros 3 ones 1662 1640.63 y? =7.372 
6 zeros 4 ones 2922 2871.09 
5 zeros 5 ones 3468 3445.31 P =0.69 
4 zeros 6 ones 2763 2871.09 
3 zeros 7 ones 1633 1640.63 
2zeros 8 ones 627 615.23 
1 zero 9 ones 143 136.72 
10 ones 14 13.67 


The gap test is usually replaced by a frequency test of the 
length of runs [33]. 

This latter method applies to the case in which all pseudo- 
random numbers in the set investigated are divided into two 
classes: class a and class 6b. 

A run is a section of the sequence consisting of numbers of the 
same Glass following each other. The number ofnumbers in a run 
is called its length. 

In the theory of runs the following concepts are usually em- 
ployed for testing ‘‘randomness’’: 


r;,—the number of runs of class a of length i, 
r,,—the number of runs of class b of length i, 


Riz =3r —the total number of runs of class a of length equal 
to or larger than & (n, is themaximum length of a run of class a), 

Rog =3ru —the total number of runs of class 6 the length of 
which is not smaller than k, 


fy 
R,= > r,,—the total number of runs of class a, 
R=1 


wy 
R, = rp, —the total number of runs of class 5, 
k=1 


R=R,+ R,-the total number of runs, 
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In checking ‘‘randomness’’ by the method of runs we must 
compare the theoretical and the empirical distributions of the 
quantities Rip, Ry», Ry; R. and R in a large sample of pseudo- 
random numbers, In this comparison the confidence probability 
p must be assigned, 

To investigate pseudo-random numbers uniformly distributed 
over the interval (0, 1), the subdivision of the numbers into two 
classes is usually done in the following manner. 

To the first class a are attributed numbers smaller than 0.5, 
to the second class 6 numbers larger than 0.5. The sampling 
values of the number of runs and of the length of runs of numbers 
of the first and secondclasses are compared with their theoretical 
limits determined according to the following formulas: for suffi- 
ciently large x (practically for 2> 20) and p=0.95 the lower limit 
of the total number of runs is equal to 


z=5(n-+1—1.65 Va—1) 


(e.g., for »=10,000 z equals 4918); the lower limit of the number 
of runs of numbers of the first (or second) class is equal to 


2(R)=2(R)= > (n— 1.65 Vn 1) 


(e.g., for n= 10,000 2(R)) = z (R,) = 2459). 

As to the distributions of the quantities R,, and R,,and the de- 
termination of the limits of the length of runs, &, it can be shown 
by methods of combinatorial analysis that the upper limit of the 
length of runs is 


e(-te) 
lg 2 


2(k)= 
(for n= 10,000 and p—0.95 we have z(k)=15). 

This means that a run of length 15 or larger (for each of the 
classes) may be expected to occur ina random sequence of 10,000 
random numbers in only 5% of the cases. 

Below we describe criteria used in comparing empirical with 
theoretical distributions, 


c) Criteria for Testing Uniformity 
A sequence of pseudo-random numbers is usually tested for 


‘‘uniformity’’ on the basis of three criteria: the y?, Kolmogorov’s 
and the w? criterion. 


214 VI, GENERATING UNIFORMLY DISTRIBUTED RANDOM QUANTITIES 


The basic, most often used criterion is the y? test. 
The y? test is based on the statistic 


l 
(vy — np,)? 
2 — ees, 
x= » np, 


where vy, is the sample number of objects in the /th interval, and 
np, is the expectation of the quantity ¥, in the hypothetical distri- 
bution (in our case the hypothetical distribution is the uniform 
distribution over the segment (0, 1). 

If the segment (0, 1) is subdivided into / equal intervals, and 
if, having taken a sample of sufficiently large size n, we evaluate 
the sum 


t 
+y (lv, — n)? (= ees =?1=+): 


i=l 


then in the case of uniformity of the theoretical distribution this 
sum will approximately follow the y? distribution with (£— 1) de- 
grees of freedom. 

The hypothesis of uniform distribution of the pseudo-random 
numbers must be rejected if y? exceeds the upper limit ;3_,(p) of 
the confidence interval, where p isthe assigned confidence proba~ 
bility and — 1 is the number ofdegrees of freedom, By confidence 
probability » we mean a probability close to 1 such that, if the 
hypothesis of uniform distribution is correct, the probability is p 
that the value obtained for y? will not exceed y?(p). If however the 
confidence limit is exceeded, this means that the measure of dis- 
crepancy, y*, indicates a significant departure, andthe hypothesis 
of uniform distribution must be rejected. 

Extremely small values of y? must be considered an indication 
of failure of randomness, since in the case of correctness of the 
hypothesis the probability for a random quantity to assume too 
small values is extremely small. Therefore, the critical region 
of ‘‘nonacceptance of the hypothesis’’ is conveniently taken to 
have two parts. A lower limit fory? may be established as follows: 


P (x? > 12_,(—)} =P (2 <x — py} 


where y?_,(1— p) denotes the lower confidence limit. 

The fact that the value of y? is found once in the interval 
[<7-11—p) xj_, (p)] does not make it certain that the hypothesis is 
correct, just as the fact that y? exceeds once the limits of the 
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interval (especially for p—0.8—0.9) does not make it certain that 
the hypothesis is false. In order to be certain of our judgment 
as to the hypothesis, we have to carry out the calculation of x? 
several times and examine how closely the empirical distribution 
of the quantity agrees with the theoretical one. Thus, in the case 
of p=0.95, the probability of lying once beyond the limits of the 
interval is small but far from negligible. In the case of correct- 
ness of the hypothesis, the value may be beyond the limits of the 
interval, or the average, inone case out of ten. However, if sam- 
pling is carried out twice and the values lie beyond the limits of 
the interval in both cases, then this event has probability 0.01 and 
must be considered practically impossible. Results of tests of 
pseudo-random numbers using the y? criterion are shown in Table 
13 on page 219, 

If in multiple testing using Pearson’s criterion the quantity y? 
does not exceed the value y}_,(p), but each time differs from it 
little, then this is evidence of the needfor carrying out additional 
investigations owing to the inadequate agreement of the hypothetical 
and the empirical distributions. 

A very effective measure of goodness-of-fit is also Kolmo- 
gorov’s criterion which is based on the statistic 


D, = max | F, (x) — F (x)|. 


where F,,(x) is the empirical distribution function, which is equal 
m 
7 : 
in the sample not exceeding the value x), while F(x) =P{X < x} 
is the theoretical distribution function [33]. It can be shown, for 
an arbitrary continuous distribution function F(x), that 


to p=—*(n is the size ofthe sample, m, is the number of objects 


h 
Ps <7] 
tends for n-—>co to the limit (see [33]) 


KQ=1—2 > ‘— 1"? ene 
v=} 


Values of the function 1—X()) are shown in Table 12. 

The goodness-of-fit of the empirical distribution of pseudo- 
random numbers with the theoretical uniform distribution over the 
interval (0, 1) isusually tested witha sample size n=~1000, The use 
of samples of larger size is extremely cumbersome, since the 
generation of the variational series (i.e., the series of differences 
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Table 12 


+00 
Table of values of the function 1—K(a)=  (—1)*e~” 


»~ 


1-K(A) r 1-K(A) r 1-K(A) 


0.000001 0.67 0.239582 1.06 0.788860 
0.000004 0.68 0.255780 1,07 0.797636 
0.000009 0.69 0.272189 1.08 0.806128 
0.000021 0,70 0.288765 1.09 0.814342 
0.000046 0.71 0.305471 1,10 0.822282 
0.000091 0.72 0.322265 1.11 0.829950 
0.000171 0.73 0.339113 1.12 0.837356 
0.000303 0.74 0.355981 1.13 0.844502 
0.000511 0.75 0.372833 1.14 0.851394 
0.000826 0.76 0.389640 1.15 0.858038 
0.001285 0.77 0.406372 1.16 0.864442 
0.001929 0.78 0.423002 1.17 0.870612 
0.002808 0.79 0.439505 1.18 0.876548 
0.003972 0.80 0.455857 1.19 0.882258 
0.005476 0.81 0.472041 1.20 0.887750 
0.007377 0.82 0.488030 1.21 0.893030 
0.009730 0.83 0.503808 1,22 0.898104 
0.012590 0.84 0.519366 1.23 0.902972 
0.016005 0.85 0.534682 1,24 0.907648 
0.020022 0.86 0.549744 1,25 0.912132 
0.024682 0.87 0.564546 1.26 0.916432 
0.030017 0.88 0.579070 1.27 0.920556 
0.036055 0.89 0.593316 1.28 0.924505 
0.042814 0.90 0.607270 1.29 0.928288 
0.050306 0.91 0.620928 1.30 0.931908 
0.058534 0.92 0.634286 1.31 0.935370 
0.067497 0.93 0.647338 1.32 0.938682 
0.077183 0.94 0.660082 1.33 0.941848 
0.087577 0.95 0.672516 1,34 0.944872 
0.098656 0.96 0.684636 1.35 0.947756 
0.110395 0.97 0.696444 1.36 0.950512 
0.122760 0.98 0.707940 1.37 0.953142 
0.135718 0,99 0.719126 1,38 0.955650 
0.149229 1.00 0.730000 1.39 0.958040 
0.163225 1.01 0.740566 1.40 0.960318 
0.177753 1.02 0.750826 1.41 0.962486 
0.192677 1.03 0.760780 1.42 0.964552 
0.207987 1.04 0.770434 1.43 0.966516 
0.223636 1,05 0.779794 1.44 0.968382 
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1.45 
1.46 
1.47 
1.48 
1.49 
1.50 
1.51 
1.52 
1.53 
1.54 
1.55 
1.56 
1.57 
1.58 
1,59 
1.60 
1.61 
1.62 
1.63 
1.64 
1.65 
1.66 
1.67 
1.68 
1.69 
1.70 
1.71 
1.72 
1.73 
1.74 
1.75 
1.76 
1.77 
1.78 
1.79 
1.80 
1.81 


1-K(A) 


0.970158 
0.971846 
0.973448 
0.974970 
0.976412 
0.977782 
0.979080 
0.980310 
0.981476 
0.982578 
0.983622 
0.984610 
0.985544 
0.986426 
0.987260 
0.988048 
0.988791 
0.989492 
0.990154 
0.990777 
0.931364 
0.991917 
0.992438 
0.992928 
0.993389 
0.993828 
0.994230 
0.994612 
0.994972 
0.995309 
0.995625 
0.995922 
0.996200 
0.996460 
0.996704 
0.996932 
0.997146 


Table 12 (continued) 


r 


1,82 
1,83 
1.84 
1.85 
1.86 
1.87 
1.88 
1.89 
1.90 
1.91 
1.92 
1.93 
1.94 
1.95 
1.96 
1.97 
1.98 
1.99 
2.00 
2.01 
2.02 
2.03 
2.04 
2.05 
2.06 
2.07 
2.08 
2.09 
2.10 
2.11 
2.12 
2.13 
2.14 
2.15 
2.16 
2.17 
2.18 


1-K(A) 


0.997346 
0.997533 
0.997707 
0.997870 
0.998023 
0.998145 
0.998297 
0.998421 
0.998536 
0.998644 
0.998744 
0.998837 
0.998924 
0.999004 
0.999079 
0.999149 
0.999213 
0.999273 
0.999329 
0.999380 
0.999428 
0.999474 
0.999516 
0.999562 
0.999588 
0.999620 
0.999650 
0.999680 
0.999705 
0.999723 
0.999750 
0.999770 
0.999790 
0.999806 
0.999822 
0.999838 
0.999852 
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1-K (A) 


0.999864 
0.999874 
0.999886 
0.999896 
0.999904 
0.999912 
0.999920 
0.999926 
0.999934 
0.999940 
0.999944 
0.999949 
0.999954 
0.999958 
0.999962 
0.999965 
0.999968 
0.999970 
0.999973 
0.999976 
0.999978 
0.999980 
0.999982 
0.999984 
0.999986 
0.999987 
0.999988 
0.999989 
0.999990 
0.999991 
0.9999920 
0.9999925 
0.9999956 
0.9999974 
0.9999984 
0.9999990 
0.9999994 


|F,(x)— F(x)|) for n= 1000 is a very difficult operation even when 
high-speed computers are used. For example, the generation 
of the variational series and the evaluation of the quantity 
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nile D,were carried out on the ‘‘Strela’’ computer, the total time 
taken for testing by Kolmogorov’s criterion being 15-20 minutes. 


1 
The hypothesis of ‘‘uniformity’’ was rejected ifthe quantity x {a D, 


exceeded 1.5. The result nl D,<0.5is also very undesirable. The 
existence of an ‘‘extremely good’’ fit would indicate that the ‘‘ran- 
domness’’ of the pseudo-random numbers is questionable, 

The interval (0.7, 1.0) is conveniently taken as the optimum 


range of values of the quantity nile D,. 

Together with the y? and Kolmogorov criteria, the w? criterion 
is also often employed. Its advantage over the x’ criterion is the 
fact that it is based only onthe individual values of the sample and 
not on data based on subdivision into intervals. 

The basis of the w? criterion is the statistic 


+c 


eo— i [F, (x) — F (x) dF (x), 


where F(x) isthe theoretical and F,,(x) isthe empirical distribution 
function (n is the sample size), If we arrange the sampling values 
X;, No, ...,<%, in a variational series, we have, inthe case of a con- 
tinuous function F(x), 


n 
eat tS [Poet 


vel 


The distribution of the random quantity w? does not depend on F(x). 
Since the Kolmogorov and o? criteria require the construction 
of a variational series, high-speed computers are conveniently 


used for evaluating the statistics nla D,,and w? whenever the sample 
size n is fairly large. According to thew? criterion (n= 1000), the 
hypothesis of ‘‘uniformity’? must be rejected if w? > 0.9005, 

Table 13 shows the results of tests of pseudo-random numbers 
using at the same time the three criteria: the y?, the Kolmogorov 
and the w* criteria, The pseudo-random numbers tested were 
distributed uniformly over the interval (0, 1) and were generated 
on the ‘‘Strela’’?’ computer by means of a three-instruction pro- 
gram, Tests were carried out for 10 different initial values of a,. 
The size n of the sample tested was 10,000. 

It can be seen from the table that for all 4) we have 10< x? < 30 
and there are thus no grounds for rejecting the hypothesis. 
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Table 13 
Serial number 
of values of x? Dnvn w* 

Oo 
1 15.4 0.84 0.00039 
2 14.6 0.69 0.00013 
3 20.1 1.10 0.00027 
4 17.2 0.74 0.00009 
5 17.4 0.62 0.00019 
6 24,1 1.04 0.00014 
7 12.8 0.91 0.00007 
8 19.4 0.87 0.00015 
9 16.5 0.79 0.00044 

10 26.7 0.65 0.00011 


It can be noticed, in addition, that the distribution of ten sam- 
ple values of y?follows in general the y? distribution with 19 degrees 
of freedom, 

Thus, with this distribution, the majority (about 50%) of the 
values of the quantity must occur in the interval (15, 25), and this 
is in complete agreement with the experimental data. This may 
serve as a basis for accepting the hypothesis of uniformity of the 
distribution of pseudo-random numbers. 


The distribution of the value of 2/2 D,, 28 shown inthe table agrees 
with the limiting distribution of Kolmogorov’s criterion. The 
majority of values occurs inthe optimum interval (0.7, 1.0), which 
corresponds to the theoretical calculations. The calculations using 
Kolmogorov’s criterion confirm the validity of the hypothesis. 

In checking the ‘‘randomness”’ and the distribution of pseudo- 
random numbers it is expedient to check the correctness of the 
operation of the program as a whole, This can be done by evalu- 
ating, in a control problem, some parameter or other the value 
of which is known a priori. Ofcourse, the calculation of the value 
of the parameter is carried out by the Monte Carlo method using 
pseudo-random numbers, If the calculated result does not depart 
from the a priori value more than the theoretically calculated 
statistical error, this means that the defects ofthe pseudo-random 
numbers do not affect the behavior of the calculations and can be 
accepted. In the opposite case we must modify the program and 
repeat the control computation until a positive result is obtained. 
We can choose asthe control problem the calculation of the volume 
of an n-dimensional hypersphere. As is well known, the volume of 
an n-dimensional hypersphere is equal to 
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qh? 

we 
(5) 
If £, is the hypercube -1<x,<+1(l1<i<n), the ratio of the 
volumes V, and E, 


2 
V.=— 


nr 
n.2"-Ip (5) 


may be calculated by the Monte Carlo method, 

In concluding, we observe that satisfactory results, obtained 
by using only once some goodness-of-fit criterion for testing 
pseudo-random numbers, do not guarantee the quality of these 
numbers, Only a repeated investigation of pseudo-random numbers 
by means of some goodness-of-fit criterion or other, based on 
comparing the theoretical distribution with the empirical distribu- 


V, 
{=z = 


tion of the statistic on which the criterion is based (i.e., 2,n 2D, 
or ”), can justify us in assuming that a given method of obtaining 
. Sequences of pseudo-random numbers is fully satisfactory. 


4, PHYSICAL GENERATORS OF UNIFORM 
RANDOM NUMBERS 


Random numbers uniformly distributed over the interval (0, 1) 
can be obtained by means of arandom physical process. To do this, 
special ‘‘attachments’’ to electronic computers, the so-called 
random-number generators, have beendeveloped and constructed, 
The use of random-number generators on an electronic computer 
enables us to obtain a combination of random binary digits in the 
array of digits of the random number represented in the electronic 
computer, 

There are two main methods of obtaining random numbers by 
means of a physical process. The first method is based on the 
radiation of radioactive substances, the second on the noise of 
electron tubes. According to the physical process used, random- 
number generators are subdivided into radioactive and radio- 
noise type. 

Let us proceed to consider radio-noise type random-number 
generators. Firstly we shalldescribe the noise sources most often 
used in such generators, after which we will pass to describe 
circuits of the random-number generators themselves. 

The basis of the operation of any radio-noise type random- 
number generators is the noise source—a noise generator. Ina 
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whole series of electronic circuits there is inherent fluctuating 
noise, which, with suitable amplification, can ensure sufficient 
fluctuation of the output voltage. Such internal sources of fluctua- 
tions comprise: 

a) thermal noise, caused by the motion of electrons in the 
space-charge volume of a tube; 

b) shot noise, caused by departure from uniformity (in time) 
of the number of electrons leaving the cathode surface; and a whole 
series of other sources. 

External sources of fluctuations comprise variable electronic 
and magnetic fields of electric generators, cosmic radio noise, 
gas-discharge devices, etc. All these sources of fluctuations can 
be used for designing noise generators. 

Several typical noise~generator circuits which give good results 
in random-number generators have been developed. The noise- 
eiichaes circuits shown below have been suggested by V, P, 

ae ls at a Computing Center of the USSR Academy of Sci- 
ae 26, 28]. 


Fig. 30, Noise generator with germanium transistor, 


Figure 30 shows a noise-generator circuit using an SID ger- 
manium transistor as the noise element, The noise generator pro- 
duces a series of random numbers of pulses which are amplified 
and shaped into standard pulses. Subsequent transformations in 
the random-number generator develop from these pulses the 
binary code representation of a random number, 

Figure 31 shows a noise-generator circuit the noise element 
of which uses a gas-discharge tube and a magnet. Noise pulses 
of the order of several tens of millivolts can be obtained directly 
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from the gas-discharge tube for a suitable orientation of the mag- 
net. The output signals from the tube are applied to the input of 
an amplifier, and working signals with amplitude of 25 to 40 volts 
are obtained at the output of the circuit. 


Fig. 31. Noise generator using gas-discharge tube with 
magnet, 


Among the noise-generator circuits considered, the circuits 
using a gas-discharge tube with a magnet give the best results for 
the operation of random-number generators. 

Of the noise generators produced commercially we must recom~ 
mend the GShN noise generator, which covers a frequency band 
from 50 cps to 6 Mcwithan effective value of the output signals of 
0.75 volt and an output resistance of 75 ohms + 1%. 


Radio-Noise Type Random-Number Generator Circuits 


There are a variety of configurations ofthe circuits of random- 
number generators which can be classified accordingly into three 
groups: 

1. Random-number generator circuits which make use of the 
random state of the system (ofthe circuit) after perturbation (after 
switching on, switching off, start, stop). 

2, Random-number generator circuits in which time intervals 
of random duration are registered, During these various (random) 
intervals of generation, standard pulses of a fixed frequency are 
sent to a counter or a trigger. 

3. Random-number generator circuits in which the number of 
random pulses during a fixed interval of time Af is registered. 

We give a few examples of random-number generators cor- 
responding to the various groups enumerated above. 
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1. One of the typical random-number generators working ac- 
cording to the principle of the random state of a system is that 
developed by V, V, Chavchanidze [64], This random-number gen- 
erator is electromechanical and is a small electric motor having 
a speed of rotation of about 3000 revolutions per minute, 

In order to remove resonance oscillations, the motor is 
mounted on a massive base. A circular diskof 10 cm diameter is 
fixed on the shaft of the motor. The circumference of the disk is 
divided in 100 equal parts numbered from Oto 99. A fixed pointer 
is mounted against the disk. The motor is connected for 2-3 sec- 
onds, is then disconnected and is brought to a stop by means of a 
brake, As the disk reaches its position of rest the pointer indicates 
a certain division on it. The corresponding random number (divi- 
sion) is recorded in a table and the motor is again connected for 
obtaining the following random number. 

In view of the fact that from considerations of symmetry the 
pointer can be found with equal probability against any one of the 
divisions (from 0 to 99) sucha random-number generator produces 
uniformly distributed random numbers assuming discrete values 
from 0 to 99. The generation of a single random number in the 
electromechanical random-number generator takes a few seconds. 
In comparison with the operation of electronic computers this 
process of generating random numbers is very slow, The low speed 
and the manual operations involved (start, stop,.recording of disk 
divisions) make sucha type of random-number generator unaccept- 
able for electronic computers. However, it can be usefully em- 
ployed when there is a need for a comparatively large quantity of 
random numbers without the use of electronic computers, 

2, Another interesting random~number generator scheme 
based on the principle of the random state of a system is that de- 
veloped by Z. Pawlak (Poland) [228]. Pawlak’s random-number 
generator uses the random state of an electronic circuit, rather 
than that of an electromechanical one as in the previous example, 
A circuit diagram taken from [228] of the Pawlak flip-flop trigger 
circuit is shown in Fig. 32. Thekey K serves to connect the anode 
voltage. 

Each switching of the key X sets the trigger (the random sys- 
tem) in one of two possible states: A or B. We can obtain, by 
means of this circuit, 2% random elements X,, where X;=A if 
the jth switching of the key K sets the trigger in the position A, 
and X,=8 if the jth switching of thekey K sets the trigger in the 
position B, 

Thus a finite set of statistically independent results A and B 
can be obtained. We give, below, one of the sets of positions of 
the trigger after switching: AABAABBABBAABBBABBAB. 
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Fig. 32. Circuit diagram oi 
trigger. 


Let y, be a set of & pairs of elements X,, such that 1,= «,_1, 
X),, Where 1<i<k. By omitting in the sequence 7, all elements 
AA and BB, we obtain a third set the elements of which are only 
pairs of the type AB and BA, Let us denote these pairs 0 and 1 
respectively. 

Let P;(A) and P;(B) be the probabilities, for the jth switching 
of the contact K, that the trigger is set in the positions A and B 
respectively. 

By assuming the trigger to be asymmetrical 


P, (A) > P;(B), 


and the properties of the trigger to be constant between two switch- 
ings, we can write 


Py; (A) = Po; (A), (6.1) 


Py_1(B) = Po (B). (6.2) 
From (6,1) and (6.2) we have 


Py -1 (A) Po; (B) = Po; (A) Po;-1 (8). (6.3) 
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Since 
Py,_1 (A) Po; (B) = P; (0) (6.4) 
and 
Po, (B) Py; (A) = P; (1), (6.5) 
then 
P,(0) =P; (1), (6.6) 


where P;(0) and P;(1) are the probabilities of obtaining the values 
0 and 1 respectively at the ith place of the sequence 7,. As has 
been indicated above, by zero (0) and one (1) are denoted pairs of 
successive states ofthetrigger ofthe type AB and BA respectively. 
Such a trigger circuit enables us to obtaina random state of a sin- 
gle binary digit. By means of Nsuch switching triggers a genera- 
tor of N-binary-digit random numbers can be assembled. 

3. A random-number generator prototype using Pawlak’s cir- 
cuit has been developed at the Computing Center of the USSR 
Academy of Sciences for the BESM-II computer, This generator 
belongs to the second group (based on the principle of a random 
interval of generation). 

Figure 33 shows the block diagram of this random-number 
generator. Noise signals are generated by a noise generator NG 
consisting of a TGIP gas-discharge tube anda pulse-shaper output 
tube. Pulses from the noise generator are applied, in a correction 
stage, toa trigger 7,. The noise pulses flip the triggers 7, and 7, 
in random sequence, and in the intervals between them pulses of 
the fixed frequency f= 200 kc can pass. The trigger 7, can have 
two possible states, A or 8B. When the random-number generator 
operates, a random set of positions ofthe trigger 7, of the follow- 
ing possible form is obtained: 


ABBABBBAABAAA. 


From the random set of positions of the trigger the special 
correction circuit removes elements of the form AA and £8 and, 
by means of the second trigger 7,, generates a new sequence the 
elements of which are only pairs ofthe type AB and BA. We denote 
these pairs 0 and 1 respectively. 

It can be shown by similar arguments to the above that 


P,(0) =P, (1). 
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32-Digit shift 
register 
(adder in BESM-II) 


f=200kc 


Fig. 33. Simplified block diagram of random-number 
generator for the BESM-I, 


In other words, the probability that the ith pulse sets the trigger 
T, in the ‘‘zero’’ state is equal to the probability that it sets the 
trigger in the ‘‘one’’ state, 

The above correction circuit thus enables us to obtain equi- 
probable random binary digits. The correction stage generates 
random code pulses (which arrive at the first digit of the shift 
register), as well as shift pulses which, via the common bus, 
reach all 32 digits of the adder. 

4, Random-number generators based on recording a random 
number of pulses in a fixed interval of time are used today in 
several scientific organizations of the Soviet Union. We give a 
general description of the random-number generator designed for 
the ‘‘Strela’’? computer at the Computing Center ofthe USSR Acad- 
emy of Sciences, A detailed description ofthe circuit and mode of 
operation of the generator is given in [26]. 

The noise~signal source is a noise generator comprising a 
TG1P gas-discharge tube witha magnet (see the circuit in Fig. 31). 
The block diagram of the random-number generator is shown in 
Fig. 34. A front view of the random-number generator for the 
‘Strela’? computer is shown in Fig. 35. The random~number 
generator is intended for obtaining simultaneously all the digits 
of twelve-digit binary numbers, All twelve digits work independ- 
ently of the others from individual noise generators. For each 
digit of the generator there are anoisegenerator, a gate, a pulse 
shaper, a trigger and an output inverter. 

At each step of operation of the ‘‘Strela’’ computer the random~ 
number generator produces a random number which reaches a 
fixed location of the accumulator of constants (the location has 
the ordinal number 7757), The transfer ofthe random number from 


the location 7757 to the working locationa is carried out by means 
of the instruction [7757| - [a] 0[13}. 


4, PHYSICAL.GENERATORS OF UNIFORM RANDOM NUMBERS 227 
ealeaR ea eahia cage caheay cance 
CPE EPC PEP EPC EPR CEP eal 
eed 9 Edd od Bd ol Bs] ed Bod 

pt} fee} [es] fea] bes} fre] f-7] [es] fre] fe fev] [re 


Fig. 34, Block-diagram of random-number generator 
for the ‘Strela’? computer. 


Fig. 35, Front view of random-number generator for the 
‘Strela’? computer, 


Let us proceed to describe the operation of the random-number 
generator, A random number of pulses from the noise generator 
is routed via the gate and the pulse shaper to the count input of 
the trigger. The circuit diagram of the gate is shown in Fig. 36. 
It has two control inputs, Tothe first input there are applied pulses 
from the noise generator, andto the second input there are applied 
positive gating pulses from a one-shot multivibrator. In order 
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to ensure that random numbers be issued at each step of opera- 
tion of the computer, the one-shot multivibrator is triggered 
at the beginning of a step by a special pulse applied from the 
control unit of the ‘‘Strela’? computer. Pulses occur at the gate 
output for the coincidence of a wide gating signal from the one- 
shot multivibrator with pulses from the noise generator. 


Gating pulses 
can 


Fig. 36. Circuit diagram of gate of random~number 
generator, 


Pulses from the gate are applied, via the pulse shaper, to the 
count input of the trigger, the circuit diagram of which is shown 
in Fig. 37. The trigger records the random number of pulses ap- 
plied at its input (even or odd) byassuming, when a train of pulses 
has ended, one of two random states: zero (0) or one (1). The 
random states of all twelve triggers (0 or 1) are transmitted, via 
the inverters, to the corresponding digits of the 7757 location of 
the accumulator of constants. The circuit diagram ofthe inverter 
is shown in Fig. 38. The twelve digits of the random-number 
generator are used as the last twelve digits of the mantissa of the 
number, 

As regards construction, all stages of the random-number 
generator are standard removable units. 

F, Sterzer (England) has constructed a random-number genera- 
tor using subharmonic oscillators [245]. The random-number 
generator is a continuous-action electronic device capable of 
generating random binary symbols ata rate of the order of 3 x10’ 
symbols per second. The generating rate can be increased by con- 
necting several such devices in parallel. 


| 
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Fig. 37. Circuit diagram of trigger of random-number 
generator, 


Fig. 38. Circuit diagram of inverter 
of random-number generator. 


The principle of operation of the equipment consists in the 
following. Two subharmonic oscillators are driven by a common 
source of oscillations of a frequency of about 4000 Mc. The fre- 
quency of the output signals of the oscillators is halved to about 
2000 Mc. Owing to the presence of noise the phases of the output 
Signals are quantities random in time. The random device 
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compares the phases of the output signals of the generators and 
generates a sequence of binary signals. Ones occur in the case of 
coincidence of phases, and zeros if the signals are found in anti- 
phase, The equipment comprises a klystron oscillator, wave- 
guides and crystal diodes. 

We should mention also the random-number generator de~ 
veloped by I, A, Danil’chenko. The mathematical bases of this 
attachment for the ‘“‘Strela’’ electronic computer are described in 
[30]. The key idea consists in balancing the probabilities of oc- 
currence of binary figures in the digit positions of the random- 
number generator by superimposing direct and inverse repre- 
sentations. The random-number generator obtains random binary 
digits with close to 0.5 probability of occurrence of a zero, and is 
characterized by stable operation. 

Let us proceed to describe radioactive random-number genera- 
tors, These usually consist of a radiation source of radioactive 
particles and a counter, The latter counts radioactive particles 
registered during a certain interval of time Af, If this number is 
even, the random digit of the generator connected with the corre- 
sponding counter is taken equal to zero. If, however, the number 
of particles is odd, the value of the random digit is taken equal to 
one, We can easily calculate that if the probability of radiation of 
k radioactive particles by the radioactive substance during the 
time At is equal to 


_, Bante *™ 

v»,=— Sar 

then the probability of occurrence of an even number of particles 
during the time Af is equal to 


Len? 4t 
p=—z— 


If ,At is sufficiently large, the value of p is close to 0.5, 

A good radiation source of radioactive particles is radioactive 
cobalt, Figure 39 shows the circuit diagram of the counter of a 
radioactive random-number generator operating with @ particles 
radiated by radioactive cobalt, An STS-2 type {$-particle counter 
is connected to the input of a three-stage amplifier. This counter 
is shown in Fig. 40. The use of transmitters of particles safe for 
the human operator has given comparatively lowfrequencies. For 
example the luminous dial-plate of watches has given random 
pulses witha repetition frequency of afew cycles per second. More 
intense sources of §-particles enable us to obtain good results in 
the operation of a random-number generator. 
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Fig. 39. Circuit diagram of counter of radioactive type 
random-number generator. 


Fig. 40. Counter of 
radioactive type ran- 
dom-number genera- 
tor with slabs of 
radioactive cobalt. 


A generator using a counter of radioactive radiation has been 
installed on the G-2 electronic computer at Gottingen at the Max 
Planck Physical Institute. The block diagram of this generator is 
shown in Fig. 41. It uses a single-digit counter of radioactive 
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radiation of y-quanta. Random states ‘‘0’? and ‘“‘1’’ are applied 
from the trigger to the input of the gate. The trigger is flipped 
after each registration of a y-quantum. The generator gives 800 
random symbols per second for a number of recorded particles 
up to 4000 y-quanta per second, 


| 2 | Output of 
random symbols 
Gating pulses 
Fig. 41. Random-number generator of 
the G-2 computer at Gottingen: 1—radio- 


active preparation, 2—counter tube, 3— 
trigger, 4—gate. 


The purpose of a random-number generator is to obtain random 
numbers uniformly distributed over the interval (0, 1). There- 
fore, a test of the quality of the numbers obtained is at the same 
time also a check of the operation of the generator itself, The 
need of periodically assessing the random numbers generated by 
means of a random-number generator makes it necessary to es- 
tablish criteria of its correct operation and to develop for it 
suitable testing procedures, 

One of the most important characteristics of a random-number 
generator is the estimate of the probability of occurrence of a zero 
or one in each digit of a random number or, in other words, the 
estimate of the maximum departure of the probability of occurrence 
of a zero or one from its prescribed value 0.5. A value of 0.5 for 
this probability will occur only in the case of an ideal random- 
number generator giving an exact rigorously uniform distribution. 
In practice, however, such a situation never occurs. Lack of 
constancy of parameters of the circuit, reduced emission of tubes 
and other reasons bring about the mismatching of the random- 
number generator, and hence departure from uniformity of the 
distribution of the random numbers generated by it. We assume 
that the probability of occurrence of a zero in each digit is not 
equal to 0.5 but to 0.5(1+%), and the probability of occurrence of 
a one is accordingly 0.5(1 —3). 

There are several effective methods of decreasing the value 
of 3. Thus good results are obtained, for example, by replacing 
the sequence of random binary numbers of the random-number 
generator {t,| by another random sequence({7,}, where 7,==,-:++£, 
(mod 2). Thus the value of 7, is one if the value of §,_, and § are 
different, and is zero in the opposite case. Thus 
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P (m= 0} =P {§,_1=0} P {&=0} +-P {&_,;=1)P (= 1) = 
ta 8p +70 — = 0.5 (1+ 8. 


Similarly P {n,='1} is equal to 0.5 (1—%8). Thusthe departure ob- 
tained is of a higher order of smallness (8 instead of 4) (see [30]). 

Such a method can be realized in practice by adding random 
digits modulo 2 inthe counter stage ofa random-number generator. 

Let us analyze in greater detail some characteristics of the 
operation of random-number generators. Suppose a sequence of 
independent random binary numbers (§,} obtained by means of a 
random-number generator has a nonuniform distribution of zeros 
and ones. Let the probability of occurrence of a zero for an ele- 
ment ¢, be equal to 0.5 (1-+-8)=p and the probability of a one be 
accordingly 0.5 (1— 8). The length n of this sequence is assumed 
to be sufficiently large. 

One of the most important statistical characteristics of a se~ 
quence {é,} is the estimate of the number of zeros (or ones) for 
sufficiently large n. 

In particular, it is ofinterest to estimate the relative departure 
of P(n, m, p) from P(n, m, 0.5) for large n where 


P(n, m. p)=(F)P"(— py ™ 


isthe probability of occurrence of m zeros (or ones) in an n~symbol 
sequence of binary numbers, 
By using the de Moivre-Laplace theorem, it canbe shown that 


‘ , _ P(n, m, p)—P (a, m, 0.5) 
A= P(n, m, 0.5) ~bYV nc, 


where C is a constant which for sufficiently large n can be made 
arbitrarily small. The relative departure A is directly proportional 


to 8 and n! 5A 

It is of interest to fit this result with the corresponding esti- 
mate in the case of small values of x. We point out that all upper 
estimates given below refer to the absolute value of the departure 
A, 

We assume, as before, that the probability of occurrence of a 
zero in each digit of the random-number generator is equal to 
p=0.5(1+8), and that the probability of a one is accordingly 
qg=0.5(1—8). Suppose the random-number generator produces 
random n-digit numbers. We shall estimate the relative departure 
of the probability P(n, m, p), that a random number contains m 
zeros, from the nominal probability P(x, m, 0.5). 
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By using Bernoulli’s theorem we obtain 
[A] << +|8|)"—1. 


If it is required to find a 8 such that |A|< A) (1 >A4,>0 is an 
assigned number), then the corresponding || must be chosen equal 


to (1+A,)* —1. 


In fact by putting (1 +-[8, |)” — 1equalto 4,we obtain the required 
result. 

Thus |A| < A, if [8] <]&]. 

We observe that for small values of 8 there exists for |A|a 
simple linear upper estimate. 


Let 0<8<-—y. Then 


A<8(n-+-0.5). 
Thus for small values of z an upper estimate for the relative 
departure A is proportional to n, and not to nl 2, as in the case of 


large values of n. In fact the departure A is of the same order of 
smallness as 8” only for values of m that are found at the ends of 


the interval (0, 2). For mm the value of A is close to zero. 


For large values of n the partialdeparturesA, accumulating on 
each other, give as a result an overalltotal departure. This total 
departure lies between the largest and the smallest partial depar- 


tures and therefore is not proportional to a but only to ni, Let us 
estimate the relative departure of the distribution function of the 
number of zeros (or ones) for the case p= 0.5(1-++ 8) from that with 
p=0.5 for small values of x. We have 


P (pcm) =h_,(2—m; m+1). 


l-p 
: 1 mo 
“Eom aL wen f g8—m-1(] — §)™ ds. 


The relative departure A will have in this case the following form 


My 
f s?-M—1 (1 — s)™ds 
1 
7 ; 
Vg ’ 
fem —aymas 
0 


A= 
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it can be shown that 
A<8(n—m) 
or, since n—m<n, 
A< $n. 


The above estimate for A shows thatthe departure A decreases 
for an increase of m and is equal to zero for m=n, 

In order that the value of the quantity 6 be sufficiently small in 
each digit of the random-number generator, a series of design 
requirements must be imposed on the individual component parts 
of the random-number generator. Of course, such requirements 
must be theoretically justified. 

Let us consider the simplest version of random-number genera- 
tor design (a noise generator, a gate, a trigger and an inverter) 
and let us find the dependence of the quantity 8 on various param~ 
eters of the trigger. It is well knownthat asymmetry of the latter 
plays an important role in the nonuniformity of the corresponding 
random digit. 

The noise generator in a random=-number generator consists 
of a noisy stage and an amplifier, From the amplifier the pulses 
are applied, via a gate and a pulse shaper, to the trigger of the 
random-number generator, 

As is well known, a trigger can assume two opposite states: 
a state A (for which the counter stage assumes the value zero) and 
a state B (for which the corresponding value is one). 

In order to flip the trigger from one state to the other, a pulse 
of a certain amplitude must be applied to its count input. Sup- 
pose the minimum pulse amplitude capable of flipping the trigger 
from state A to state B is equal to a,. At the same time a pulse 
that flips the trigger from one state to another may be insufficient 
to reverse the state. 

Without loss of generality we can assume that the minimum 
pulse amplitude capable of accomplishing the reverse flipping of 
the trigger from state B to position A is equal to a, and that 
a, >a,. Thus a pulse having amplitude larger than a, accomplishes 
the flipping of the trigger in either direction. 

Pulses of even larger amplitude are characterized by the 
property of accomplishing a double flipping of the trigger: from 
one state to another and back, Let the minimum amplitude of 
such a pulse be equal to a, >a). 

Let p, be the probability for a pulseto have amplitude less than 
ai 
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Pla <a} =P. 


Let p, be the probability for the pulse amplitude to lie between a, 
and Qo} 


P {a,<@ < ay} = pp. 


Let p, be the probability for the pulse amplitude to lie between a, 
and a; 


P {a, <u < a3} = py. 


Finally let p, be the probability that the pulse amplitude is not 
smaller than a,: 


P (u>a,} = py 


We denote by P, the probability that after the arrival of the 
nth pulse the trigger assumes the state A, and by Q,the proba- 
bility of the opposite event, 

We denote by P, the probability that, before the random-number 
generator starts to operate, the trigger is found in the state A, 
and by Q, the probability of the opposite event. 

Of course, we have to estimate the probability of occurrence of 
zero (or one) in a digit of the random~number generator for the 
stationary (steady-state) process of obtaining random binary digits 
from the corresponding trigger. To do this we must study the be- 
havior of P, in the transient process that leads to stationary 
operating conditions of the random-number generator. 

By applying the elementary theory of Markov processes, we 
can prove the following: 

The probability of nonoperation of the trigger after a pulse 
has arrived from the noise generator is p,, and p,, p, and p, are 
the probabilities of unilateral, bilateral and double flipping of the 
trigger, Then, in the process of establishing stationary operating 
conditions, the probability of occurrence of a zero at the corre- 
sponding digit is expressed by the formula 


—__ Ps =e Ps ae a 
Pr ~~ po+ 2ps +(P, P2+ 2p ) Pe 2Ps) , 


where, as stated above, P, is the probability that the trigger as- 
sumes the zero state after the arrival of the ath pulse and P, 
is the probability of the same event before the generator starts to 
operate. 
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For sufficiently large n, P, can be taken equal to 


ae .: ae 
P= +o, 
Hence 
gr oa ee 
arat slieea ee 
—_ P2 
@ ~~ Po + 2p; ° 
If ppo< p;, then 
1 pe 
Q—P~>. 


If we consider all pulses to be operative, then, by putting p,=0, 
we obtain 
Po 23 Po 
P2+2p3 1+ Ps3—Ds 


If p,»<< p; and p,<_p3, then p,~ 1 and therefore 


1 
Q—P~>Z Pr 
If we put the difference Q— P equal to 0.001, then p,~ 0.002, 
Since Q—P=84, then 
1 
ins Py 


If the voltage of the noise generator is equal to Vnc volts, then, 
applying the formula p,= 28, the spread of the operating voltages 
(reduced to the amplifier input) must be not larger than 


Vs ss 23Vnc . 


If the gain of the amplifier is equal to &, then the spread of the 
operating levels of the trigger must be not larger than 


AV=kV 5 = 2k8VNG- 


If k=40, A~=0.01, Vnc =2.5 v and n=12, then AV=0.2v, 

If the mean operating level ofthe trigger is equal, for example, 
to 15 v, then we must provide inthe trigger such a stability of the 
resistances and voltages that the relative spread of the operating 


levels shall not exceed aes + 100% 1.38%. 
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Such requirements, arrived attheoretically, set very stringent 
limitations on the parameters of random-number generators. 
Moreover they are not met inthe great majority of random-number 
generators satisfactorily operating today. However, they can often 
be dispensed with, since the quality of randomnumbers generated 
by means of random-number generators is assessed basically by 
means of the system of accepted statistical criteria which must 
be satisfied by the aggregate of random numbers.* 

Several random-number generators which provide stable and 
sufficiently high-grade service are now in use, They satisfy a 
system of statistical tests despite the fact thatin individual cases 
the departures of the probabilities of occurrence of individual com- 
binations of digits exceed critical values. 

In order to illustrate the potentialities of random~numher gen- 
erators, we shall cite, for example, data of a random-number 
generator operating on a “Strela’? computer at the Computing 
Center of the USSR Academy of Sciences 6]. 

Experience gained in solving a whole series of problems by 
means of this generator as well as the results of repeated tests 
enable us to draw the following conclusions; 

1) The random-number generator gives a relative error inthe 
results which does not exceed 10-12%, but this error is less than 
10% for the great majority of problems and lies within the range 
4~10%. 

2) The probability of occurrence of a zeroora one at an arbi- 
trary binary digit of a number generated by means of the random- 
number generator varies within the interval 0.44 < p,< 0.56. This 
estimate has been obtained as a resultofa series of tests extend- 
ing over a long period of time. 

3) For the majority of problems solved by means of random~ 
number generators the result differs from the result obtained 
by means of pseudo-random numbers by less than the evaluated 
statistical error within a single «, 


5. TESTS OF THE OPERATION OF 
RANDOM-NUMBER GENERATORS 


The quality of random numbers obtained by means of random- 
number generators must be periodically tested. This is necessary 


*In addition, the probabilities in the individual digits can be evened out by 
several special methods described in (Chapter I, Section 3) and also in [26, 28, 30]. 


| 
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for preventive maintenance of the installation, since the process 
of random-number generation can go out of control: the generator 
can ‘‘run wild.’’ 

Therefore, on all electronic computers containing random- 
number generators periodic test routines are run several times 
a day in order to check the performance of the random-number 
generator. As a rule, test programs include [26]: 

a) A program to test ‘‘randomness’’ by using the method of 
‘‘runs’’? (see Section 3). Tests according to this criterion usually 
give perfectly good results, 

b) Tests for checking departure of the empirical distribution 
from uniform. The goodness-of-fit criteria usually employed are 
the #? criterion and Kolmogorov’s criterion, although other, 
related methods are often used, Thus at the Computing Center 
of the USSR Academy of Sciences distribution tests are carried 
out according to the x? criterion on k groups of 10,000 numbers 
each (k= 5—7). 

If for any group tested the value of pi exceeds 30, thenit is as- 
sumed that the distribution does not coincide with the uniform 
distribution regardless of the results of tests of the remaining 
groups. In this case the degree of confidence of our hypothesis of 
uniformity of the distribution is less than 0.05. In addition to this, 
another more severe distribution-testing criterion has been es- 
tablished. For any/ (1 <j < k) the mean number of random numbers 
occurring in the ith interval (1 <i< 20) must not lie beyond the 
limits , 


500—2y/ Bc} “e <500+2y/ ™ 
p=l 


ij 


(for any / and j), where », (i) isthe number of objects occurring in 
the ith interval at the pth test, 

The latter criterion is based on a trivial probability relation: 
with probability 0.95 we have 


i 4 
y © _ 500 


<2— 
p=1 / : 


Vi’ 


where o? is the overall variance of the number of elements in the 
ith interval. However 


o? = npg == 10,000 «35 "op © 500 (a ~ Y 500). 
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After & tests, the distribution is reckoned to be uniform if and 
only if both criteria are satisfied at the same time, 

A suggested criterion for testing the distribution in the digits 
of a random-number generator is the estimate of the distribution 
of the empirical entropy with respect to all digits. 

Let & be the number of binary digits of the random-number 
generator and let m, be the number of zeros obtained in n tests in 
the ith digit, Let us denote by H, the empirical entropy in the ith 
digit 


A= — Tint — (1 — 72) in (1 — 2), 


It can be shown that ifthe probability of occurrence of a zero in all 
digits is equal to 0.5, then the random quantity 


k 
c= 2n( hind — > H,) 
f=) 


is asymptotically distributed, for noo, as Ge 

Tests of the distribution of the entropy © must be carried out 
for a sufficiently large 2 (for example “= 10,000), The empirical 
distribution of the values of { obtained is compared with the Gi 
distribution. If the comparison gives satisfactory results, the 
distribution of the digits of the random~number generator is reck- 
oned to be sufficiently close to uniform, 

c) The execution of a test associated with evaluation of the 
parameters of a control problem is obligatory. We can use as 
such a control problem the evaluation of the volume of an n- 
dimensional hypersphere. 

When the results of all tests are satisfactory the random- 
number generator may be assumed to be working normally and to 
be ready for operation, If some test gives unsatisfactory results, 
operation of the generator must be investigated more closely, 
Several other statistical characteristics have to be calculated. 
These are mainly data concerning the spectral structure of the 
random process of the generation of numbers by the random- 
number generator. 

The production by a random-number generator of random num- 
bers uniformly distributed in the interval (0, 1) can be considered 
as a stationary random process « (f)with discrete time. 

At any instant of time the random quantity (f) has one and the 
same uniform distribution. Therefore the expectation Mx (t) and 
the variance Dx(#) are constant for any instant of time and are 


equal to?/, and */1. respectively. 
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It is well known that a probability process (f)is called sta- 
tionary in the broad sense if 


MIx (4)?] < co 
for all values of ¢ and if 
Bt) =M (Ix ¢+7)— Me @l[xO—Mex Ol} 


does not depend on /. The function &(c) is called the covariance 
function of the process. 

In the case of the random process of obtaining numbers from a 
random-number generator all these conditions apply and, more- 
over, the covariance function 8(t) need not depend on+ but has a 
constant value equal to zero, 

The determination and testing of statistical estimates of the 
random process are conveniently carried out by averaging data 
obtained asa result ofa concrete realization of the random process 
over a sufficiently long interval of time. = 

It is well known that the variance of the mean value x (f)is equal to 


pe), where “ is the number of realizations on which averaging is 
carried out, From this, confidence intervals for the sampling 
average x (f)can easily be constructed, 


R # 
We have, with probability O()=Te fe? at, for n = 1000: 
an 
i] 


1 R a Rk = 
Plays <<? 9< 24+} 


for p= 0.997 we have k=3, 
Therefore, with probability 0.997 we have 
0.47 < x(t) < 0.53. 
For k= 2, p=0.95 and with probability 0.95 we have 
0.48 < x (t) < 0.52. 


A statistical estimate of the covariance function B(t) is given 
by the formula 


B@=LY eG) —D«G)—2)h 


i=l 
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where x (¢,) is the value of the realization of the random number at 
the time of the ith step of operation of the electronic computer, 
and x is the mean value of the random numbers (~ 0.5). 

A very useful test to check the quality of a random-number 
generator is to compute the mutual covariance function B,, (*) 
for the stationary processes of generating zeros and ones in the 
various digits of the random number. 

If the contents of the digits in random numbers generated by 
means of random-number generators depend on each other, the 
function B,,(t} will either vary with time or else will assume 
values differing from those calculated theoretically. 

The quantity 8,,(t) has the following estimate 


By => Yixt)—* OMY G.I—IOL 


j=l 


where x(t) and y(f;,.) are the contents of the two tested digits of 
the random number generated at the jth step of operation of the 
electronic computer, x(f) and y(t) have the value 0.5, and a is the 
size of the sampled sequence (n =~ 1000). 

A confirmation of absence of correlation is provided by a sta- 
tistical estimate of the correlation coefficient between digits, The 
calculation of the empirical value of the correlation coefficient r 
is carried out from sampling data of size according to the 
formula 


To establish whether correlation coefficients r obtained from 
observations are significant or whether their departure from zero 
may be explained as the result of random sampling, we must test 
the hypothesis »=-0 (p is the theoretical correlation coefficient), 
The empirical correction coefficient risusedas the test statistic. 
By choosing a confidence coefficient p, we construct a region of 
the form 


jr|> top 


and if the value of the empirical corrélation coefficient r obtained 
lies beyond these limits, then the hypothesis p=0 is rejected. 
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Here ¢, is determined from the relation 


and 


In other words, if 


r 


— 
ll> aaa 


we reckon that correlation exists between the digits. The value 

p= 0.99 is taken for the confidence coefficient p, Hence ¢, = 2.58. 
This type of test must forman integral part of the overall pro- 

gram for testing the operation of random-number generators, 


Chapter VII 
Transformation of Random Numbers 
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The construction of algorithms for the solutions of problems 
by the Monte Carlo method is intimately connected with the de- 
sign of random experiments, In this connection there often arises 
the need for generating random quantities or processes of 
various types. These comprise in the first instance independent 
and dependent random events, Markov chains, random quantities 
with assigned distribution laws, random vectors, stationary and 
nonstationary random processes with assigned probability char- 
acteristics, etc, 

It has been shown in Chapter VI how to generate sequences 
of uniformly distributed random quantities either by means of 
special subprograms (in which case pseudo-random numbers 
are obtained) or by means of random-number generators. 

In this chapter we describe algorithms for transforming uni- 
formly distributed quantities, enabling us to obtain random quan- 
tities or random vectors with practically any distribution law. 
Thus the generation of random quantities on electronic computers 
reduces to obtaining independent quantities with uniform distri- 
bution laws and to subsequently transforming them by means of 
special programs. We could also use special generators for 
quantities with various distribution laws (for example with the 
Gaussian or Poisson’s distribution law), but this approach is not 
followed in practice in general-purpose electronic computers, 

Thus, we shall assume that we have available a set{R,} i=1, 
2,3, ...) of random numbers &, having uniform distribution over 
the interval (0,1). This is called the initial set, On the numbers 
R, we need to carry out such operations that we may obtain, asa 
result, a set {S,} of values of a random quantity with an assigned 
distribution law, In view of the fact that the number of digits of 
the numbers processed by electronic computers is finite, we shall 
not be able to operate, strictly speaking, with random numbers 
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R, having uniform distribution over the interval (0,1); what we 
shall actually have is a set {Rj} of random numbers 2} having a 
so-called quasi-uniform distribution. 

As is well known, a continuous random quantity } uniformly 
distributed over the interval (0,1) has the density function 


1 for 0¢x <1, 
Aint a oldest (7.1) 


The expectation of the random quantity is equal to 
: 1 
MiI=>, (7.2) 
the variance is equal to 
1 
D> (7.3) 


and the standard deviation is equal to 


A 
~ 2¥3° 
The random numbers 2} are realizations of a random quantity 
C differing from § Let us study the distribution of ¢. The ran- 
dom numbers Rj are so generated (see Chapter VJ) that at each 
digit of a #-digit number there occurs a zero or a one, each with 
probability equal to 0.5. Therefore possible values of the dis- 
crete random quantity 6 will be numbers R} 


(7.4) 


0,1. 2 £. 2k—1 
” oR’ me seen gr’ eae oR 


the probability p, of each of which is equal to 2~* 
The expectation of the ad ¢ is equal to 
oka 


2k (2k—1) 
Mo = rae aa 3 (— — — gaat ~—~C~S (I —3)- (7.5) 


The variance is easily evaluated by proceeding from the expres- 
sion 


t= > ea! (7.6) 
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where «, are independent Bernoullian quantities assuming with 
equal probability the values zero and unity. 
Thus the variance of the quantity ¢ is equal to 
4 1X 1 a 1 1 
Ss a -a_ _ 
pt= >) 2°"De, = 4 »» agen!) 0 


ke 
f=1 f=] ‘ 


4 


The departure of the expectation (7.5) from the value 1/2 can 
prove significant if the number & of digits is small. Therefore it 
is often convenient to consider a quantity 4 of the form 


j= rr gh (7.8) 


This quantity assumes the values 


1 2 


0, yas ee a ee E, 
2*—1° 2— |] 


The expectation of the random quantity q is equal to 


ot _4 of 1 


Mid= a ot 2 bie: 


i=0 i= 


However 


oF} 


> . 2F(2#_1) 
eae Tin. 


i=0 


and therefore 
‘ (7.9) 


bo} = 


M [n] = 
The variance is equal to 


of _y 


Dii= ¥ (4-3) o-*. 


é=0 
By taking into account that 


ke 


2"-1 


= (2* —1) .2* (2k+1 —1) 
oan mabenaacme * 


i=0 
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we shall obtain 


1 2F4+1 


Dal=pa (7.10) 


The standard deviation will be equal to 


<i Qk 4.1 7.11 
1 OVS Vy x4. co 


When k-co we have o,-> WT’ The order of approximation can 
be seen from Table 14. 


Table 14 
k Dr] s, & Df{r) Sy k Df] Sy k Dr) Sy 


2 0.1389 0.3727 5 0.0887 0.2979 8 0.0840 0,2898 12 0.0834 0,2888 
3 0.1071 0,3274 6 0.0860 0.2933 9 0.0837 0,2893 15 0.08334 0,28870 
4 0.0945 0,3073 7 0.0846 0.2910 10 0,0835 0.2889 20 0.08333 0.28868 


We observe that D [=] = 1 © 0.08333 and o, = wie 0.283868. 

If the number of digits of the numbers processed in the com- 
puter is large, the difference between {Rj} and uniformly dis- 
tributed random numbers may be ignored in the solution of the 
majority of computing problems. Problems exist, however, for 
the solution of which this difference must be taken into account. 

Below, unless otherwise stated, we shall assume the initial 
set of numbers {&,;} to be uniformly distributed over the interval 
(0,1). 


2. SIMULATION OF INDEPENDENT RANDOM EVENTS 


Suppose we have to simulate a random event A which occurs 
with given probability p. Let § be a random quantity having a 
uniform distribution over the interval (0,1). The values assumed 
by this quantity are denoted, as above, by R;. 

We shall identify the event A with the event ‘‘a selected value 
R, of the random quantity satisfies the inequality 


Ri<p.” (7.12) 
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It can easily be seen that the probability of the event 4 is 
equal to 


Pp 
P(ay=f dx =p. (7.13) 
0 


Then the opposite event A is the event 


R,> p 


and its probability is equal to P(A)=1—p. 

A procedure for simulating trials of the type considered con- 
sists in selecting values R; and comparing them with the quantity 
p. If in a given comparison the condition (7.12) is satisfied, then 
the outcome of the trialis the event A. If the condition (7.13) is 
not satisfied, the outcome of the test is the event A. 

The above considerations can be extended to a group of 
events, 


Let .A,, Ag, .-., A, be an exhaustive set of events occurring 
with probabilities p,, p,,..., p,. As is well known, we have in 
this case 


Ptht .-- +p=!l. 


We shall identify the event A, with the event ‘‘the selected 
value R, of the random quantity satisfies the inequality 


> 


Ign SRySly (7.14) 
where 
Ld * 
= D Py (7.15) 


Similarly to (7.13) we can write 


i 


m 

P(A,)= f dx= py: 

In-1 

The trial simulation procedure consists in this case in 

sequentially comparing random numbers R, with the quantities /,. 

The outcome of a trial proves to be the event 4,, if the condition 
(7,14) is satisfied. 

This procedure is sometimes called the determination by lot 

of the outcome of a trial in which the possible results have prob- 

abilities p,, po,.... Ps It is often necessary to examine trials of 
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this kind, each of which is, however, a composite event depending 
on two or more simple events. 

For example, let independent events A and B have probabilities 
pa and p, respectively. Possible outcomes of simultaneous trials 
will be, in this case, the events 


AB, AB, AB, AB (7.16) 
with probabilities 
PaPa» (1—Pa)Pa Pa(l—Pg) (L—pa)—pzg). (7.17) 


It is evident that to simulate simultaneous trials, two versions of 
the procedure can be used. The first of them consists in sequen- 
tially testing a condition similar to (7.12) with respect to the 
events A and 8. However, we can also proceed differently. A 
second version may be constructed by analogy with (7.14) as the 
determination by lot of one of the outcomes (7.16) in accordance 
with the probabilities (7.17). 

The first of the versions considered requires the use of two 
numbers &; and two comparisons to test conditions (7.12). In the 
second version we can restrict ourselves to a single number R,, 
but a greater number of comparisons may in general be required. 
In the practical solution of problems the choice of one version of 
the procedure or the other is determined by consideration of the 
ease with which the algorithm may be constructed as well as 
economy in the number of machine operations required and in the 
number of locations of the working memory. On the whole the 
first version proves more economical than the second one. 

Let us consider the case when the events A and B are not 
independent. As before, let the probabilities of the events A and 
B be denoted by p, and pg. In addition we assume that we are 
given the probability p(B/A) of the event B on the assumption that 
the event A has occurred. 

The first version of the above-mentioned procedure will have, 
in this case, the following setup. 

From the set {R,} there is extracted a number R, and the 
validity of the inequality 


Ra< Pa (7.18) 


is tested. 

If the inequality (7.18) is found to be valid, this means that the 
event A has occurred. Therefore for the test connected with the 
event B we use the probability p(B/A). From the set {R;} we take 
the next number R,,,, and test the condition 


Rasi< p (BIA). (7.19) 
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Depending on whether the inequality (7.19) is valid or not, the 
outcome of the test is either AB or AB. 

If the inequality (7.18) is not found to be valid, this means that 
the event A has occurred. Therefore, for the test connected with 
the event B we must use the probability p(8/A), This probability 
can be determined from the theorem of total probability 


p(B) = p(A) p(B/A)+ p(A) p(8/A), 


from which it follows that 


Pg—PaP (BA) 


p (B/A) = ~8 roa (7.20) 


We select from the set {R,} the number &,.,; and test the validity 
of the inequality 


Rasi <P (BlA)- 


Depending on whether this inequality is valid or not, we obtain 
for the test the results AB or AB. 

We can also use the second version of simulating procedure, 
To do this it suffices to note that the events 


AB, AB, AB, AB 
form an exhaustive set and have probabilities 


Pa> P(B/A), Pall— p(B/A)], ty 
(1— py) p(B/A), (1 pall — p(B/A)], 


where p(B/A) is determined by the relation (7.20). 

More complex algorithms can also be constructed ina similar 
manner, We shall restrict ourselves to showing the principle of 
simulation of simple Markov chains. 

A simple homogeneous Markov chain is determined by a 


transition matrix 
Pu Pig «+> Piz 


Possible results of trials are the events A,, A,,..., A,. The 
probability p,,; is the conditional probability of occurrence of the 
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event 4, in a trial, given that the result of the previous trial was 
the event A;. 

The simulation of such a Markov chain consists in sequentially 
selecting events A; by lot according to probabilities p,;. It pro- 
ceeds as follows. 

We choose first an initial state assigned by the initial prob- 
abilities py), Pon... ++ Poge 

To do this we select from the set {R,| anumber R, and compare it 
with the quantities /, as in(7.14), where the quantities pp), pop. ---+ Pos 
are used as the p,’s in (7.15). 

In this way we determine an integer m, which satisfies the in- 
equality lm, < Ry</mot1. 

Thus the initial event of the realization of the chain will be 
the event An,. Then we select the next random number R,,,, which 
is also compared with quantities be Here, however, we use, as the 
probabilities p, for determining /;, the elements Pag? Pm *** Prue 
of the transition matrix, By means of the comparison, the num- 
ber m, for which the condition tested is verified is established, 
and the next event of the realization of the chain will be the 
event Am, We continue in a similar manner. It is evident that 
each number m, determines not only the next event Am, of the 
realization but also the distribution of the probabilities 
Pra? Par 0st Prop for selecting the next integer m,,.. 


We observe that for ergodic Markov chains the effect of the 
initial probabilities decreases rapidly with the increase of the 
ordinal number of the test. Therefore we can take, for the quan- 
tities pp. Poo» +++ Pox, aYbitrary values, for example 


1 
Por = Poa = +++ =P, ZF 


The key idea of the procedure described is retained also for 
more complicated Markov chains, for example, for nonhomo- 
geneous chains. 


3. TYPICAL FEATURES OF THE SIMULATION 
OF EVENTS BY MEANS OF RANDOM 
NUMBERS WITH FEW DIGITS 


The rules considered above are strictly valid only in the case 
when random numbers &; having uniform distribution over the 
interval (0,1) are used to simulate trials. 

However, when electronic digital computers are used, only 
random numbers 2 with quasi-uniform distribution are available. 
This case has typical features which it is desirable to discuss 
here, if only briefly. 


252 VII. TRANSFORMATION OF RANDOM NUMBERS 


Let us consider first the determination of results of trials by 
verifying the validity of condition (7.12). 

Suppose we have available k-digit numbers with possible 
values 


i 


wo | GH 01,2... HD). (7.21) 


Ri = 

We now replace R; in the inequality (7,12) by the number Rj. 
Thus, the simulated event A* is identified with the event 

Ri<p. (7.22) 

The probability P(A*) can be found as the ratio of the number na of 

numbers of the form (7.21) smaller than or equal to p, to the 

number N of all numbers of the form (7.21). As is well known, 

N=2', Thus 
1 2 
P(A’) => (7.23) 


We can draw from the relation (7.23) the following conclusion: 
if the probability p of the event A is comprised within the limits 


: (7.24) 
then P(A) =—, 

It follows’ from this that the use of the numbers &} instead of 
R; leads to an error in the value of the probability of the event, 
equal to eee p=Ap, Clearly the maximum value of the error Ap 
does not exceed => r 

A similar situation arises also in the more complex case in 
which results of trials are determined by verifying the validity of 
conditions (7.14). 


4, METHODS OF OBTAINING RANDOM 
NUMBERS WITH ASSIGNED 
DISTRIBUTION LAW 


There exists a basic relation that connects random numbers 
S; having an assigned distribution law and numbers &; having 
uniform distribution over the interval (0,1). 


| 
| 
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The possibility of obtaining such a relation derives from the 
following theorem (see, for example, [33]): if the random quantity 
= has a probability density f(x),then the distribution of the random 
quantity 


€ 
n= f f(x)ax (7.25) 


is uniform over the interval (0,1). 

On the basis of this theorem we can establish the following 
rule. In order to obtain a number belonging to the set of random 
numbers {5S,} having the probability density function f(x), we must 
solve with respect to S, the equation 

8; 
f f (x) dx = R;. (7.26) 


-—o 


To prove this rule, we consider a random quantity 7 having 
uniform distribution over the interval (0,1), and a random quan- 
tity § connected with 7 by the relation (7.25). 

We assume that f(x) nowhere reduces identically to zero. 
Then, accordingly to (7.25), 4 is a monotonically increasing func~ 
tion of §, and therefore § can be expressed as a single-valued 
function of 7 

5= $(%)- (7.27) 


It can easily be seen that the inverse function 
n=9'() 
is expressed in this case by the relation (7.25), By bearing this 
in mind we shall find the probability density of the random quan- 
tity &. The distribution function F;(x) is equal to the probability 
thatt< x: 
F;(x) =P < x). (7.28) 

We substitute for ¢ in (7.28) its value from (7.27) 

Fi (x) =P [9(m) < x]. (7.29) 


Since the function §=¢(7) is monotonically increasing, the in- 
equality 


a(n) <%* 
is equivalent to the inequality 
4 < om! (x). 
Soros Fi(x)=Pin< @7 (x). (7.30) 
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The probability occurring in (7.30) can be evaluated since the 
density function f,(y) of the random quantity 7 is known: we have 
in fact assumed that 7 is uniformly distributed over the interval 
(0,1). Therefore 


etx 


Feox)= f f,(yay 
tt) 


or 
pix 


Fe(x)= fo dy=e7(x). (7.31) 
0 


By substituting for ¢7!(x) in (7.31) its expression from (7.25) we 
obtain x 
Fr(x)= f f(x)dx. 


The latter relation shows that the random quantity has the density 
function f(x). 
In a number of cases, the relation (7.26) can be directly used 


for practical purposes. Let us consider some examples. 


Suppose it is required to obtain random numbers with an ex- 
ponential distribution law 


f (x) =he-** (x > 0). (7.32) 


By virtue cf relation (7.26) we obtain 
a 
he =o 
rN f e-*dx=R, 
or, after evaluating the integral, 
1—e"“i=R,. 
By solving this equation with respect to x,, we obtain 
x=—H+In (1— R). (7.33) 
If we have available random numbers A, uniformly distributed 
over the interval (0,1), then, by using formula (7.33), we can con- 


struct a sequence of random numbers x; having the exponential 
distribution (7.32). 
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Suppose it is required to obtain random numbers x, with 
distribution law 


r h 
f(x)=4(1—5 +) (0<x<z) (7.34) 
which finds application in the solution of certain problems of 
queueing theory. 
By using relation (7.26) we can obtain 
rN 
a(x, at a x)= R:. 
Hence 
2 ———— 
“= F(1-—V1I—R). 


As our next example, let us consider the probability density 
function 


1 
(=e (9<*< =): 


c 


Relation (7.26) has in this case the form 


Therefore 


- (7.35) 
a saa NT 


It may be shown in a similar manner that the quantities 
S; = oV— In R; V2 
are distributed according to the Rayleigh law with parameter sa: 
a 
F (x)= l1—e *, x>0 
0, x<0. 


Random numbers distributed according to this law play an 
important role in radio-engineering problems. 
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In certain problems we have to operate with random quantities 
7 distributed according to Pareto’s law 
1 
1——, x>1, 
pea: z > 
x<l. 


Such quantities are obtained from uniformly distributed pseudo- 
random numbers § by means of the formula 


NE 4, 


which is of particularly simple form if a=+,where kis an 
integer. 

We have considered above examples of transformation of 
random numbers into possible values of continuous random 
quantities, 

We consider now methods of simulating discrete random 
quantities, Suppose a discrete random quantity § assumes values 
x; with probabilities P(x—=-+x;)=p;. We call the occurrence of a 
possible value x; the event A,, If the number of x/s is finite, we 
can directly use the methods of the previous section. These 
methods are often applicable, however, also for an infinite set 
of values x,. 

Suppose, for example, that we need to obtain random numbers 
having Poisson’s distribution 


Paya se* (90,1, 2,...). (7.36) 


To do this we select random numbers &; and test the validity 
of inequalities of the type (7.14) 


ba-1 < R; ly» (7.37) 


where 
Lee S (01%...) Ly =O. (7.38) 
n=0 


If the inequalities (7.37) are found to be satisfied, the next ran- 
dom number 7 is assumed equal to ”. Clearly the quantity 7 has 
the Poisson distribution. 


_ 


| 
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Rules for the transformation of random numbers, based on 
relation (7,26), enable us to obtain a sequence of random num- 
bers x, that satisfy exactly an assigned distribution law if and 
only if the numbers RX, are uniformly distributed over the interval 
(0,1). 

In practice, instead of the numbers &,, we usually employ 
random numbers i, having quasi-uniform distribution. Owing to 
this the assigned distribution law is only approximately realized. 

If the number of digits of the numbers A; is equal to &, the 
transformation defined by the relation (7.26) operates on 2" dis~ 
tinct numbers &}, Therefore the number of distinct values x, can- 
not exceed 2*, If the numbers x, must be represented with & 
digits, then to represent all 2* numbers x; we may require in 
general a number of digits &*>k. This is explained both by the 
nonuniform disposition of the numbers x, on the numerical axis 
and by the fact that the numbers x;may exceed in absolute value 
the limits represented by #-digit numbers. 

Let us proceed to consider methods of transformation of 
random numbers not connected with the solution of equation 
(7.26). 

One of them consists in selecting, from a uniform set {R;,}, 
random numbers x, satisfying a certain condition in such a man~ 
ner that x, will be subject to the assigned distribution law. 

Suppose it is required to obtain random numbers x, having the 
probability density function /,(x). If the region of possible values 
of the random quantity § is not restricted on both sides, we have 
to pass to the corresponding truncated distribution. We assume 
that the region of possible values for the truncated distribution 
is the interval (a, 3). 

By the change of variables 


——— (7.39) 


b—a 


we pass to the random quantity 7 with probability density function 
f, (9) = (6 — 2) fela+(6— a) y}. (7.40) 


The random quantity 7» has the interval (0,1) as its region of 
possible values, Let the maximum value of /,(y) in this interval 
be equal to f,,. By a change of scale on the axis (0, f,(y)] we re- 
duce the interval (0, /,,). to a length equal to unity. Then 


AM= (7.41) 
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The resulting probability density function f*(y) may be in- 
scribed in a unit square as is shown in Fig. 42. 


Fig, 42 


Let us choose a pair of numbers 2-1, Ro; from a sequence 
of uniformly distributed quantities. This pair determines a ran- 
dom point in the unit square (Fig. 42). 

We shall take as values y, only those numbers R,, of the set 
{R,,} that satisfy the condition 


Ru < F(Roi- 1) (7.42) 
or 
Ry <=" [a+ (6 — a) Roi -1]. (7.43) 


There emerges thus the following procedure for generating a 
random quantity having probability density f; (x). 

From a set of random numbers R, uniformly distributed over 
the interval (0,1) we select pairs of numbers Ro-1, Ro. We test 
for the numbers Ry-;, R,; the validity of the inequality (7.43). If 
(7.43) is found to be satisfied, the next number x, is determined 
from the relation 


x; =a-+(b— a) R,;. (7.44) 


If the inequality (7.43) is not satisfied, the pair of numbers 
Ry-1» Ry is rejected and the following pair is selected. 
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It can be seen that the random numbers +, thus obtained have 
the distribution density /; (x). 

The methods given above for transforming random numbers 
require for their realization a considerable number of operations, 
This is especially true in those cases in which either Eq, (7.26) 
cannot be solved exactly or else the calculation of the right-hand 
side of the inequality (7.43) proves to be very laborious. In 
particular, none of these methods is applicable in practice for 
obtaining a sequence of random numbers with the Gaussian 
distribution. 

Let us consider methods of transforming random numbers 
which are based on approximate simulation of the conditions for 
which the limit theorems of probability theory are valid. 

Suppose it is required to obtain a set of random numbers {S;} 
having a Gaussian distribution with expectation a and standard 
deviation 5, 


hO= TE e-,” (7.45) 


By virtue of the central limit theorem of probability theory and 
under some very general conditions, the sums of a large number 
of random components are asymptotically distributed according 
to the Gaussian law. Therefore we can use the addition of num- 
bers of an initial set for approximately simulating normally dis- 
tributed random numbers, 

Since in this case the components have uniform distribution 
over the interval (0,1), we find from the limit theorem for 
identically distributed random quantities that, if the independent 
random quantities §,&....,%, have all one and the same prob- 
ability distribution and if each 2, has expectation a* and standard 
deviation o*, then the sum 


PSE tht ee Ey 
is asymptotically normal with expectation 
a=a‘n 
and standard deviation 
o=o* Vn. 
It can be shown that the sum £ has a distribution close to 


normal even for comparatively small values of n, In the solution 
of applied problems zis usually taken equal to 4—8. 
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As is well known, for a random quantity having uniform dis- 
tribution over the interval (0,1) the expectation is equal to1/2 


and the standard deviation is TE . Therefore the sum of n such 
quantities will have expectation 


= Z (7.46) 
and standard deviation 
=> = ° (7.47) 


Let us consider now what happens when, to obtain normally 
distributed random numbers by summation, we use quasi- 
uniform numbers Rj. 

It has been shown in Section 1 that random numbers having 
quasi-uniform distribution over the interval (0,1) are character- 
ized by expectation 1/2 and standard deviation 


tet yf Mtl 

2V3 2*&—1 

Therefore the expectation of the sum § is given by expression 
(7.46) and the standard deviation is 


n 241 
[== /= 1° (7.48) 

The need for relation (7.48) arises only when random numbers 
R; with few digits are used in the summation process. 

Of course, an increase of the number’ of components in the 
sum £ leads to the distribution law of § coinciding more closely 
with a normal distribution. However, this also leads to an in- 
crease of the number of operations needed for transforming 
random numbers. ; 

As has been shown in [7], special transformations may be used 


to improve the asymptotic normality of the sum &. 
For example, if 7 is the normalized sum 


n= = vi (7.49) 


i=l 


of random quantities § having uniform distribution over the 
interval (— h, +h), then the quantity 


E=1 ay 1-7) (7.50) 
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will have a distribution sufficiently close to normal for values 
of n substantially smaller than in the case of simple summation, 
The distribution law of the quantity = will be close to normal for 
n as small as 5 (see [7]). If another transformation is used, 
namely 


= 1 —aaphar (CP — 107+ 15%), (7.51) 


then it suffices for practical purposes to have n= 2, 

To use this type of transformation a larger number of opera- 
tions is required than in simple summation. Therefore, only by 
estimating the expenditure of machine time for solving a given 
class of problems can we assess the expediency of choosing a 
particular value of n for the number of components and using 
one transformation rather than another, 

Random normal numbers can be used to obtain functions of 
normally distributed quantities. As an example, we shall give 
the method of generating numbers distributed according to 


Rice’s law 
xt+s 


p(x)= te 3 J (sx), 


where /,(x) is the modified Bessel function of the first kind and 
zero order, 

The general method of obtaining numbers on the basis of 
inverting the distribution function is obviously inapplicable here. 
An indirect approach is basedonthe fact that Rice’s law describes 
the probability distribution of the absolute value (the modulus) of 
a random vector on a plane, with independent normally dis- 
tributed components having variance o? and nonzero expectation. 
To construct a concrete example, we can assume that the ex- 
pectation of the first component is equal to S and that of the 
second component is equal to zero, By denoting these normal 
quantities by & and &, we express the required Rice quantity y 
in terms of them: 

Another method of electronic computer generation of quanti- 
ties with Rice distribution is based on their physical meaning as 
the absolute value of the vector sum of two vectors: one constant 


of length s and one random having length ¢ distributed according 
to the Rayleigh law.* By considering the triangle formed by 


* The first vector simulates the voltage of a constant signal and the second vector 
a noise voltage, 
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these vectors and by denoting by » the angle between the com- 
ponent vectors, we have by the cosine theorem 


n= VEE — 2k cosy. 


A similar idea can be applied for obtaining random numbers 
having a Poisson distribution 


k 
P(k) =<, e-2 


with expectation a. 

Here we use Poisson’s limit theorem: if p,is the probability 
of occurrence of an event A in a single trial, the probability of 
occurrence of & events in # independent trials for n—>oo and 
P,—~0 is asymptotically equal to P(&), 

We choose a value of x large enough to ensure that 


Pa= a (7.52) 
is smaller than unity (it is usually required in practice that 
p,—=0-1—0.2). If we simulate a series of » independent trials in 
each of which the event A occurs with probability p,, then the 
random number x; having Poisson distribution must be chosen 
to be the number of cases of the actual occurrence of the event A. 

In conclusion, we shall consider a fairly general approximate 
method of transforming random numbers, based on piecewise 
approximation to the probability density. This method admits of 
a very simple realization on computers and in many cases 
ensures the required transformation accuracy. It has, therefore, 
wide acceptance in practice, 

Suppose we need to obtain random numbers with probability 
density f,(x) and a bounded region of possible values(c, d). If the 
range of possible values is not bounded, we pass to the corre- 
sponding truncated distribution. Let us split (c, d)into z intervals, 
and denote by a, the left-hand boundary of the Ath interval. 

We can adopt the following procedure for transforming ran- 
dom numbers: 

1) the random choice of an interval among the n possible 
intervals (the determination of the quantity a,), 

2) the random choice of the number y,, inthe interval (a,, a,.1) 

3) the generation of a random number x, from the relation 


X= Oy + Vai (7.53) 


From the point of view of realization on computers it is most 
convenient to choose the probabilities of occurrence in all 
intervals (a,, @,,,;) to be the same. 
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In this case we choose a number of intervals n large enough 
to ensure the required accuracy in the transformation of the 
random numbers, This number is conveniently chosen in such a 
manner that n=2”, where m is an integer smaller than or equal 
to the number of binary digits of the random numbers of the 
initial quasi-uniform set. 

In the simplest and most common case, the density function 
f;(x) is approximated within each interval by a constant value f,. 
This means that the random quantity 1, has a uniform distribution 
in the interval (a,, a,,). 

The quantities f, and a, are evaluated in advance and are 
placed in the memory of the computer before the beginning of the 
computations connected with the transformation of random num- 
bers. To calculate the values of a, and/, we can proceed in the 
following manner. If the number of intervals (Fig.43) into which 
the range (c, d) is divided is equal to 2, the probability of occur- 
rence of the random quantity £ in the interval (a,. 4,,,) must be 
‘equal to 1/n. Therefore 


Ret 


J feast. (7.54) 
if 3 


c Oy Aer d x 


Since a, is known and is equal to c the relation (7.54) enables us 
to evaluate one after another all the required values a,. Now, 
for each interval we can easily evaluate the values f, from the 
relation 1 

Fe* Gri — =F (7.55) 


The fact that the evaluation of the quantities a, sometimes 
involves lengthy computations is not very important, since these 
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calculations, being carried out in advance, do not affect the number 
of operations in the random-number transformation program. 

The random-number transformation procedure reduces to the 
following, From a set of quasi-uniform random numbers we 
extract a number A2;-; and we use the first m = log, n digits of this 
number as the address for sampling from a table of values ofa, 
and a,,,. Then we extract the next random number 2}, and 
determine x, from the relation 


%; = Oy + (Gg41 — 24) Roy. (7.56) 


This method for the approximate transformation of random 
numbers is very compact from the point of view of the number of 
‘operations required. It proves particularly convenient when we 
can choose a comparatively small number n (for example 16, 32 
or 64), We observe that the number of operations needed for the 
transformation of random numbers by means of this method does 
not depend on a, i.e., it does not depend on the accuracy of the 
approximation to the distribution law. The accuracy of the 
approximation determines only the size of the table containing 
the quantities a,. 

A disadvantage of the method is the fact that the accuracy of 
the approximation of the function f,(x) is not everywhere the 
same over the range (c, d). The accuracy depends on the quantity 
f, and decreases for small values of f,, Therefore we must 
choose the number of intervals 2 with a view to ensuring the 
desired accuracy over the interval with the smallest value of f,. 

The relation (7.53) enables us to obtain, on a computer, the 
value of a Rice-distributed quantity by means of two uniform 
quantities. One of them is needed for generating a Rayleigh 
distribution for ¢, and the other, to be later multiplied by 2r, is 
the phase displacement ¢ of noise with respect to signal. In the 
programming method of generating uniformly distributed quan- 
tities, the two methods described for generating Rice-distributed 
quantities require approximately the same expenditure of machine 
time, 


5. SIMULATION OF RANDOM VECTORS 
AND RANDOM FUNCTIONS 


In solving problems by the Monte Carlo method, the need often 
arises for generating samples of random vectors and in particular 
discrete realizations of stationary and nonstationary random 
processes, We shall consider in this connection certain methods 
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of random-number transformation which are convenient from the 
point of view of machine computation. 

Let a two-dimensional random vector (§, 7) be assigned by 
means of the joint density functionf(x, y). Having available a set 
of random numbers {R,} which have uniform distribution over the 
interval (0,1), we need to obtain a sequence of realizations of the 
random vector (. 7). 

We find the partial density function of the random quantity » 


f= ff wax. (7.57) 


—-oco 


From the set of random numbers [A,} we choose a number R,,_; 
and by one of the methods considered in the previous section we 
determine the number y, corresponding to its having distribution 
density f,)- 

Let us now consider the conditional distribution of the 
random quantity § 


F(x yf) 
REI) = 5. 7) (7,58) 


From the set of random numbers {R,} we choose the number 
Ro, and determine a number x,, corresponding to it, having 
conditional distribution density /;(x/y,). 

It can be seen that the sequence thus obtained of pairs of 
numbers (x;, y,;) has the joint density function f(x, y). 

Similar relations are obtained in the case of a three-dimen- 
sional vector. 

Let the joint density function f(x, y. 2) of the components 
& m4 and ¢ be assigned. If random numbers x,, y, z, have distri- 
butions 


+0 +0 
£@= f f fo y. 2) dx dy, 
ane 
y— [ fa&yz 7.59) 
Sf, (y/z,) : ra (z) dx, ( 


a f (x, Yi zi) 
fi (x/¥j, z)) FA (2) Ff, 2D ’ 
then the components of the random vector (:, 7, 6) will have the 
joint probability density f(x, y, z). The procedure for generating 
xX; ¥, 2 is similar to the two-dimensional case. 
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These methods of generating random vectors are, however, 
very laborious. If the integrals of the density functions (7.57) and 
(7.58) or (7.59) are expressible in closed form, relation (7.26) 
may be used for generating realizations. Otherwise recourse 
has to be had to approximate methods. The use of approximate 
methods is made difficult by the fact that in the simulating 
process we must take into account the dependence of the condi- 
tional probability densities on the values of the components 
actually obtained, 


6, THE SIMULATION OF CERTAIN 
MULTIDIMENSIONAL QUANTITIES 


Let us consider a method of simulating n-dimensional random 
vectors whose components are normally distributed. We denote 
by 1, %:-++. %, the components of the random vector. The joint 
probability density of these quantities is equal to 


1 -4 (x m-'x) 
P(Xjo Lg. voor *) = VaxD 2 ; 
where M is the covariance matrix,X =(x,, *, ..., x,)andD=det MQ, 
The key idea of the method of simulation to be described is that, 
having generated n independent normally-distributed quantities 
byfo. +6, With zero expectations and unit variances, we subject 
them to such a linear transformation A that the resulting quanti- 
ties 7.7)..-+» TN, Will have the preassigned covariance matrix 
M=—=||m,,||. We observe that the property of normality of a distri- 
bution is retained under linear transformation, We shall show 
now how to choose the linear transformation A in such a manner 
that it converts the quantities §,, &,...,§, into %, m,..-., 7, With 
the assigned covariance matrix. 

We assume the transformation matrix to be triangular: 


= yb 
Ny = 4q181 + Angie» 


Na = Gnthi + Gaaia tt +++ + Ganka 


Its elements a@,, will be found from the condition 


MQ; = May- 
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According to our assumptions we have 


1, k=], 


Mists =? =| 0, k+l, 


and therefore 
Mai = a}, Me = af, =m, 
1.e., 
a= Vy. 
Let us determine the elements of the second row of the matrix A: 


Miya = 44199) MEi + 21, Q2oMb bg == 214g) = tty» 


i.e., 
— _™i2 
aay Ym, ’ 
and then 
My == a2, M+ 24,,2ME,é, + a2.Me = a ote a2, = Mg, 
i.e., 
_ MogM11 — Mig 
on = my, 


By proceeding in a similar manner we can determine one after 
another all elements of the matrix A: after we find its first k 
rows the (k-+1)th row is evaluated by means of £-- 1 equations 
obtained from the conditions 


Mist Mery P= 1, oe he 
The choice of a transformation with a triangular matrix is 
advantageous in that, in the first place, such a matrix occupies 
only 2@+) locations in the memory of an electronic computer 


(and not n? as would be the case if it were not triangular) and, in 
the second place, the number of arithmetical operations required 
to transform the numbers %, ..., 4, is reduced, 
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By denoting by —=(&,,..., ,) a vector the components of 
which are independent normal pseudo-random numbers, and by 
1=(Ny .++» N,) a vector the components of which have an assigned 
covariance matrix M, we can briefly express the procedure for 
obtaining the vector 4 by means of the relation 


n= Ae 


We observe that the generation of a set of normal quantities 
with nonzero expectations a,,a,,...,a, is accomplished by means 
of the addition y,+ 4,, ..-1 %,+ 4, 

In problems of detection theory (see Chapter V) normally 
distributed quantities serve to simulate the components of 
voltage vectors, When the number of elements in the message is 
equal to n, 2n-dimensional distributions must be simulated. It is 
possible in some cases to restrict ourselves to two independent 
groups of quantities, each of which is n-dimensional and repre- 
sents a corresponding set of components. 

The voltage amplitudes are represented as the moduli of two~ 
dimensional normal vectors. Let us consider methods of simu- 
lating multidimensional amplitude distributions when there is 
correlation between the voltage vectors, 

We shall simulate quantities assigned by the density (5.12) 
corresponding to the case in which independent noise voltages 
are combined with a signal varying from signal element to signal 
element by a constant factor 


D(Xp veer XY Q= 
2 
n *z .'s 1 2) 2 
-—_—_ ame — —+ — 
T] «e - [ee ( 2D)? TT] 4 (2;504s. 
k=l 0 J=1 


(We assume the noise intensity to be the same in each of the 
elements and we take it as unity). 

The first of the simulation methods suggested reproduces 
essentially the derivation of this density given in Section 1, 
Chapter V: x Rice-distributed quantities 7,, 7,,...,., are generated, 
the &th of which has density 

r+ans? 


Pp(X) = xe 2 [yg (@pSx), 


where the parameter s, common to all quantities, is random and 
is distributed according to the Rayleigh law 


5 ae 
p()= Fre a 
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Such a procedure ensures that the process has the desired 
structure: the useful signal remains constant within a given 
sample and fluctuates from experiment to experiment; the 
successive noise voltages combined with it are independent. 

Another simulation method operates with the voltage-vector 
components: having obtained 2n normal quantities with zero 
expectation and unit variance, we distribute them in pairs 


Eis M3 bor Nes e+ +3 Sad Ne 


representing the components of successive noise-voltage vectors. 
Then we generate two more normal numbers s, and s,, again 
having zero expectation, but with variance equal to qg?. These 
numbers simulate the components of the vector representing the 
useful signal. By denoting by a, the factor that distinguishes the 
Signals in a given element from those in another element, we 
write the expressions for the amplitudes of signaland noise com- 
bined 


Pp V gt ag5))? + (Mp + 2p Sy)”: RW, fia ht: 


We observe that in the problem considered the set of 2n com- 
ponents of the signal-plus-noise voltages forms a multi- 
dimensional normal quantity with nondiagonal covariance matrix. 
A set having density (5.15) can be simulated by a similar pro- 
cedure. 

The quality of random numbers obtained on electronic com- 
puters can be assessed by means of a series of statistical 
criteria. Some such criteria for uniform quantities have been 
given in Chapter VI. We shall consider methods of checking the 
quality of models of multidimensional quantities, 

For normal vectors there is no need for a special investiga- 
tion of the goodness of fit of experimental with theoretical 
distributions. In fact, if the initial independent normal numbers 
(from which the required correlated normal numbers are gen- 
erated by means of a linear transformation) have undergone 
goodness-of-fit tests, then their linear combinations, being sums 
of independent quantities, are bound to satisfy these criteria. 
Attention must be devoted however to determining the covariance 
matrix of the realized normal vector, since even a weak correla- 
tion of the initial uniform pseudo-random numbers may appre- 
ciably affect the correlation of the components of this vector. 
The closeness of the theoretical to the sample covariance 
matrix is estimated on the basis of the usual statistical 
rules. Thus the simplest test of the quality of a model of a 
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multidimensional normal quantity consists in evaluating the 
sample second moments and comparing them with the assigned 
values. 

A more accurate method of testing a modelof a normal vector 
consists in evaluating the probability that the point corresponding 
to the components will be found within the ellipse defined by the 
inequality 

XMUX<C 


or, more explicitly, 
nr 


iy ty SEs 
where a,, are the elements of the matrix M™', the inverse of the 
covariance matrix. In terms of the y-distribution we can easily 
give an exact expression which is compared with the experi- 
mental value. The latter is determined in the following mamner: 
having determined the x components of the multidimensional 
quantity to be simulated, we evaluate 


n 
By XX js 
PF arin} 


and if the value of this quadratic form does not exceed C, we 
consider the experiment to be successful and add unity to the 
counter of the number » of successful outcomes. In the contrary 
case the experiment is considered unsuccessful. Having carried 
out such an experiment a sufficiently large number of timesN 
(of the order of several thousands), we find the required value of 
the relative frequency as */N. This method enables us to verify 
simultaneously both the normality of each component of a pseudo- 
random vector and the existence of the assigned correlation. Its 
disadvantage is that a large part of the computer memory is 


occupied by the matrix M =a) (ASE locations ) and by the 


triangular transformation matrix S$ (the same number of loca 
tions), and this sets a limit to the number of dimensions of the 
quantities which may be tested. 

Another, simpler method of testing the quality of a model of 
a normal vector —=(n,. ...,7,) is based on the fact that the quantity 


CaM eee 
is normal with mean value 0 (ifMu= ... —=My,= 0) and variance 


dD. = 2X mp |} me, |] = De. 


i,j 


| 
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This method is implemented by verifying, by the ¢, ~test, the 
goodness of fit of the empirical distribution of the quantity 7 with 
the normal distribution of parameters (0, YDt,). 

A more complicated check is that of the quality of models of 
multidimensional amplitude distributions that are not deter- 
mined uniquely by the second moments nor even by a finite set 
of quantities. Models of such distributions can be tested by 
evaluating the frequency with which each amplitude exceeds a 
number of levels, as well as the frequency with which various 
pairs of amplitudes exceed simultaneously assigned levels. The 
choice of these rather than other characteristics is due to the 
fact that they can be compared with known theoretical values: 
one-dimensional probabilities (assigned by the Rayleigh distri- 
bution) and two-dimensional probabilities (assigned by Rice’s 
two-dimensional amplitude density). The latter corresponds to 
the amplitudes r, and r, built up from the normal quantities 
bs Ss Ege Ey (Fi SHE, ri = +E) with covariance matrix 


of 0 ab 
0 oe —ba 
a —b o? Q ||" 

| 8 a 0 o? 


N= 


Let us write D=—detM. Then Rice’s joint amplitude density has 
the form 


rr. (+r a’ +b? 
3 exp | — sna BD Mast Pe ry >0 


0, ry: fy <0 or ry <0, rye <0, 


and the corresponding distribution. function 
aes 
P(x, y=f fee fo) ar, aro 
o 0 


can be easily tabulated by expanding the function /, in a power 
series. 

Joint distributions of three or more dependent amplitudes 
are considerably more complicated, and therefore it is not ex- 
pedient to calculate the frequency with which an assigned level 
is exceeded at the same time by two amplitudes, since the 
theoretical probabilities are not available. 
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The quality of pseudo-random numbers can also be investi- 
gated by simulating problems that admit of exact solution. An 
example is provided by the calculation of the detection char- 


acteristics with amplitude-square summation (see Section 2, 
Chapter V). 


Appendices* 
I, Table of Random Numbers 


86515 90795 66155 66434 56558 12332 94377 57802 
69186 03393 42505 99224 88955 53758 91641 18867 
41686 42163 85181 38967 33181 72664 53807 00607 
86522 47171 88059 89342 67248 09082 12311 90316 
72587 93000 89688 78416 27589 99528 14480 50961 


52452 42499 33346 83935 79130 90410 45420 177757 
76773 97526 27256 66447 25731 37525 16287 66181 
04825 82134 80317 75120 45904 75601 70492 10274 
87113 84778 45863 24520 19976 04925 07824 76044 
84754 57616 38132 64294 15218 49286 89571 42903 


75593 51435 73189 64448 31276 70795 33071 96929 
73244 61870 28709 38238 76208 76575 53163 58481 
23974 14783 17932 66686 64254 57598 26623 91730 
32373 05312 94590 22561 70177 03569 21302 17381 
59598 56774 08749 43448 28484 16325 62766 31466 


91682 12904 29142 65877 64517 31466 02555 52905 
87653 98088 75162 97496 59297 79636 74364 16796 
79429 66186 59157 95114 16021 30890 21656 93662 
85444 39453 67981 49687 36801 38666 50055 11244 
85739 44326 91641 40837 93030 03675 18788 91332 


84637 76154 14150 07876 41899 69207 66785 87225 
59575 32764 91090 66515 05498 51512 16107 52141 
81305 58846 69558 41675 88898 23775 30649 86545 
29835 35801 23472 22700 39976 21279 36694 85970 
32795 54313 39072 16809 22148 60102 18465 87650 


37837 12507 54594 30814 23277 99497 11037 63718 
58394 96952 12181 11641 83373 14726 23541 25774 
74543 46849 95714 70358 95873 94136 83991 77299 
77338 59570 29277 82041 06923 01795 77022 17443 
21157 50634 16432 44292 20030 38547 67134 95995 


*For explanations of Tables I and II see page 279, 
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46747 
65087 
31523 
31051 
82112 


16363 
48752 
21440 
08051 
25601 


46351 
10145 
22896 
58479 
04510 


32882 
86678 
45406 
66178 
00385 


32767 
92934 
17106 
34586 
62265 


27700 
70221 
28191 
65318 
78069 


41219 
10049 
34098 
06810 
81495 


48498 
61078 
13314 
83359 
92230 


15614 
17420 
70181 
78489 
45793 


91017 
72467 
77078 
77898 
33322 


98006 
24503 
04821 
11756 
93838 


44475 
06118 
03247 
94899 
04954 


45035 
57660 
13031 
40984 
99652 


63524 
49927 
84929 
12664 
49921 


99848 
75413 
25973 
01522 
67384 


84032 
17571 
64748 
93058 
23434 
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57723 
79559 
60580 
46105 
01251 


77362 
12333 
54895 
76323 
27768 


68132 
45377 
78192 
40940 
99861 


62628 
05562 
72727 
39761 
41398 


14305 
32300 
30146 
63908 
35244 


01708 
01538 
06817 
10543 
82444 


51620 
19045 
91865 
97114 
98286 


57014 
91916 
40943 
98270 
04513 


14233 
71028 
66259 
63541 
42918 


47792 
25578 
95743 
33133 
63779 


47103 
82639 
51085 
79909 
41097 


52985 
20241 
19163 
76018 
17221 


13280 
84302 
18590 
92255 
07823 


77312 
90184 
70649 
40249 
02888 


08823 
60702 
68605 
62039 
87008 


95782 
57831 
68585 
56043 
30181 


26300 
11105 
45475 
51391 
98647 


26455 
38302 
12820 
97236 
07284 


39792 
95363 
27249 
59279 
55904 


97184 
08118 
05251 
44756 
17577 


96298 
04683 
52577 
53697 
28474 


78805 
21571 
33028 
39158 
97849 


98062 
85072 
14518 
69126 
07131 


77992 
50342 
84037 
36371 
36046 


80126 
43860 
23232 
62321 
32045 


96339 
40607 
53855 
33847 
64203 


20469 
22033 
90973 
65440 
30897 


75243 
89133 
63010 
08177 
63470 


17838 
94781 
42083 
28837 
64579 


06778 
73597 
24007 
12202 
25016 


44178 
15636 
48420 
79248 
25669 


94837 
82640 
32216 
07726 
97874 


23963 
64747 
38489 
94557 
24312 


40125 
09174 
43806 
19881 
47631 


03412 
09662 
33421 
59937 
51144 


38108 
55420 
23874 
70297 
02495 


19095 
46083 
44904 
67279 
49830 


13283 
71368 
14863 
66946 
26861 


70712 
34982 
10643 
60948 
94275 


28824 
93678 
94790 
29952 
11872 


67058 
29415 
36452 
63413 
32913 


64329 
04703 
09473 
28066 
98591 


22394 
93227 
86581 
10832 
20490 


49598 
42586 
65161 
09954 
89134 


32573 
40014 
05413 
44882 
32783 


57336 
11117 
73577 
77806 
09521 


87692 
75562 
20715 
00820 
090985 


87884 
64517 
38819 
11523 
58211 


94963 
82419 
14586 
10750 
25165 


62146 
99230 
97630 
77456 
29391 


51950 
95186 
17230 
77916 
93622 


83563 
47322 
18874 
93884 
92928 


26722 
94624 
68792 
38521 
49780 


66828 
47312 
56294 
71891 
05688 


97125 
89284 
43549 
72526 
80713 


42281 
00226 
18022 
19104 
68251 


74314 
74419 
63828 
62790 
19981 


49899 
16830 
46940 
93441 
50490 


93994 
45099 
88108 
79187 
40720 


78753 
59891 
27427 
48419 
78887 


26759 
59833 
53333 
45357 
58043 


60150 
89132 
31388 
10917 
32624 


51625 
50453 
40586 
79772 
11007 


05177 
67052 
89101 
89645 
26253 
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Table I (Continued) 


33907 
83348 
24470 
19246 
01034 


85596 
01616 
04507 
34102 
32937 


05504 
56658 
47478 
39413 
00552 


56688 
04764 
55350 
18324 
13397 


81964 
82636 
12249 
60728 
47618 


39264 
83816 
36579 
71396 
56130 


08070 
74467 
81516 
76663 
01014 


68910 
12793 
09949 
04823 
81542 


99969 
57837 
26119 
06711 
13229 


28215 
09704 
09945 
39925 
61626 


72118 
30765 
81898 
91229 
59380 


75496 
56768 
64148 
07146 
89114 


39814 
14536 
45116 
48298 
27883 


64860 
21519 
46587 
41691 
09695 


52065 
80685 
51880 
33372 
23840 


21809 
80539 
52569 
86417 
98017 


59315 
44518 
17982 
69123 
18305 


23947 
01665 
33690 
83750 
26413 


54076 
18373 
71577 
40574 
53997 


53496 
51856 
55303 
56770 
49188 


15297 
86326 
07083 
12224 
59045 


81513 
37571 
76678 
68941 
01106 


47126 
80961 
41343 
97965 
43638 


24999 
96900 
15280 
93256 
22338 


52309 
20589 
10074 
94081 
04039 


90368 
67557 
39054 
57347 
65397 


13728 
40016 
58556 
04602 
27699 


84805 
89886 
67523 
60338 
52277 


91252 
82168 
47188 
07315 
71153 


39063 
23793 
59674 
90055 
10053 


73890 
30763 
66292 
32464 
37326 


86568 
29195 
21053 
74417 
24297 


93342 
33640 
05881 
73021 
30668 


49677 
90142 
51933 
31686 
53297 


13948 
77970 
01777 
61162 
35557 


09304 
23608 
70045 
14268 
26665 


99719 
59947 
33755 
37971 
74145 


12496 
14528 
55405 
94387 
64291 


40346 
37246 
02121 
63394 
87030 


77188 
76460 
75883 
25270 
96442 
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21435 
33858 
80362 
25354 
99537 


30608 
97074 
79347 
27094 
28931 


81906 
45800 
58752 
87186 
29701 


54066 
54650 
38043 
54342 
90439 


99399 
89263 
65621 
55539 
36200 


52088 
00820 
65813 
16156 
83350 


05698 
92137 
14564 
04344 
68210 


02130 
85299 
73989 
06569 
24748 
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09236 65687 99595 29732 02164 29355 67021 79078 
86058 55744 01628 60154 76141 33877 73149 52993 
98654 31271 93089 91432 73210 46338 40570 75134 
24230 02041 22592 73570 55838 19176 33471 37223 
83760 48336 89010 41179 15624 55978 91734 09731 


16070 98566 23294 48608 61162 18826 10094 87310 
65772 28976 54794 53750 68663 94619 16582 08993 
77384 59106 95769 94443 16357 99405 60130 16686 
78120 63979 39085 12164 32556 13427 29805 65147 
73325 04458 25636 89350 28926 23348 98963 00053 


26453 48826 50357 66819 85259 97684 70052 81361 
94451 80733 51978 99816 96777 69889 64682 91635 
14786 85391 28116 71989 51460 99376 65904 88517 
47904 97167 87158 36103 27949 51309 59285 52608 
39322 89151 73891 63876 99274 85497 15690 97412 


Il. Table of Normal Quantities 


0.2005 1.1922 -0.0077 0.0348 1.0423 ~1.8149 1.1803 0.0033 
1.1609 -0.6690 -1.5893 0.5816 1.8818 0.7390 -0.2736 1.0828 
0.5864 -0.9245 0.0904 1.5068 -1.1147 0.2776 0.1012 -1.3566 
0.1425 -0.2863 1.2809 0.4043 0.6379 -0.4428 -2.3006 -0.6446 
0.9516 -1.7708 2.8854 0.4686 1.4664 1.6852 -0.9690 -0.0831 


-0.5863 0.8574 -0.5557 0.8115 -0.2676 -1.2496 -1.2125 1.3876 | 
1.1572 0.9990 -0.1032 0.5405 -0.6022 0.0093 0.2119 -1.4647 
0.4428 -0.5564 -0.5098 -1.1929 -0.0572 -0.5061 -0.1557 -1.2384 
0.3924 1.7981 0.6141 -1.3596 1.4943 -0.4406 -0.2033 -0.1316 
0.8319 0.4270 -0.8888 0.4167 -0.8513 1.1054 1.2237 -0.7003 


0.9780 -0.7679 0.8960 0.5154 -0.7165 0.8563 -1.1630 1.8800 
0.4083 0.1172 0.4380 0.2153 0.2471 1.2222 -0.5332 0.6769 
0.2506 0.1451 -0.1072 1.5161 -0.1149 1.7168 -0.4254 -0.4328 
-0.6155 -1.6600 0.6499 -1.1385 -0.0786 0.0723 -2.1401 -0.5111 
2.1961 0.8373 0.8334 0.0836 0.5572 -0.8472 0.6057 -0.2271 


0.5955 -0.9567 -0.4525 1.7141 0.1166 -0.3035 -0.8648 -0.2762 
0.8878 1.4399 -0.1673 1.9888 1.3912 1.1409 0.1408 1.1105 
1.3596 0.6264 0.7770 -0.4714 -1.8767 -0.3315 -0.4552 0.5407 
0.3961 0.3172 1.3063 1.7748 -0.7831 1.1407 -1.2317 -1.5495 
1.3996 -0.5412 -0.3513 0.0959 -1.2982 -0.2092 1.4988 -0.8433 


1.0572 
-0.7934 
0.7095 
0.4189 


0.9204 
1.0469 
0.0373 
0.8325 
-1.0980 


0.8285 
0.5849 
1.2694 
-1.8795 
~1.4345 


-0.2127 

0.4394 
1.7181 
1.1704 
-1.4312 


-0.5767 
-1.7750 
0.6989 
1.4573 
-1.9278 


-0.0307 
1.9745 
-0.6722 
0.8308 
-0.9590 


0.3326 
0.6549 
1.2105 
0.5716 
-0.0708 


-1.1670 
-1.7540 
-0.1531 
0.0814 
1.0686 
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0.6103 0.2143 -1.7647 0.8872 1.2249 0.2244 1.3989 


0.0863 -0.5204 
1.4052 1.5732 
-0.8784 -0.4474 
-1.1017 -0.3966 


-1.1859 0.6551 
1.3231 -1.4720 
1.0704 0.6511 
0.3653 0.9094 

-0.5706 -0.9678 


-0.9246 0.2817 
-1.2114 1.9777 
~0.8690 -0.6200 
2.1241 -1.0324 
0.8420 -2.1421 


-0.6388 0.5343 
0.6535 -1.4222 
1.0547 1.1764 
2.3965 -0.1719 

-1.3493 1.2580 


-0.1932 -0.3199 
1.3836 0.5768 
-0.9137 0.8286 
-0.5137 -0.8159 

1.3322 1.7982 


0.2432 0.1295 
-0.9768 -0.6119 
-0.8153 0.3077 

2.5666 0.4702 
-0.0971 0.4413 


-0.7979 -1.5355 
0.1661 -2.4684 
2.3642 -1.1261 
0.2217 0.5378 
1.3550 -0.8212 


-0.2764 -0.9646 
1.4139 -0.6992 
1.3783 -0.0558 

-0.6202 -1.5163 
0.4729 0.0907 


0.1195 0.4177 
0.0606 2.8545 
-1.6972 1.7019 
0.2764 -0.8211 


-1.1844 -0.0327 
-1.5188 0.1818 
0.0283 1.3351 
-1.2804 -0.3766 
-0.2011 -2.6891 


-1.4527 -1.8206 
-0.7401 -2.1394 
0.8522 0.3989 
-0.5972 -0.5367 
0.3184 1.7609 


0.9895 -0.1026 
0.5028 -0.9808 
0.9064 -1.6218 
1.4232 0.6767 
1.5037 -0.2647 


-1.2330 1.1220 
0.8314 0.4712 
-0.6841 0.2309 
1.6976 1.6808 
-0.2411 0.0470 


0.9592 1.1758 
-2.3175 -1.5597 
-0.4988 -1.3960 

0.0564 0.2810 
-1.6344 0.5451 


-1.3690 2.0035 
-0.8117 -0.1016 
0.1402 0.4794 
0.1267 1.3062 
-0.3456 0.2366 


-0.4825 0.9841 
-0.0217 -0.8669 
0.5885 -1.1431 
-0.5086 0.3290 
0.6189 2.1538 


-1.3384 -1.1699 
0.1878 -0.0843 
-0.0541 0.7496 
-2.1006 0.3755 


2.6107 -1.5307 
-0.3886 -1.1073 
-0.0447 1.0698 

0.1126 -0.7246 

0.7349 -0.2401 


-0.9273 0.1009 
-0.1418 0,5652 
-1.6043 1.3361 
~1.1800 1.2464 
1.3321 -1.7729 


0.4739 -1.5747 
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0.2175 
-1.5144 
0.7661 
0.8885 
0.5803 


0.2634 
0.2446 
1.7287 
-0.6078 
2.1936 


0.4673 
-0.2171 
-0.3627 
-0.9108 
-0.7298 


0.2136 


0.7944 -0.1824 -1.2026 


0.5456 -0.2568 
0.1161 0.5588 
-1.2482 -0.3690 


-0.1292 -1.9742 
-1.8326 0.0974 
0.8760 -1.3196 
-0.5607 1.6192 
0.7516 -1.2719 


~0.2544 -0.3663 
0.1089 -0.8989 
-1.9540 0.1807 
0.0845 0.4265 
~0.4258 -0.5102 


0.8132 0.1999 
0.8073 -0.0469 
-0.8293 -0.1925 
0.7560 1.3532 
1.1349 0.2758 


0.9214 -0.9516 
1.3461 0.3107 
1.5575 -0.0893 
-2.6179 1.3521 
-1.5846 0.0904 


-0.0006 
-0.1572 
1.4047 


-0.2299 
0.0229 
0.8593 

-1.5036 

-1.1642 


1.8976 
0.4925 
0.5275 
0.7492 
0.3929 


-0.3996 
2.2602 
-1.5751 
0.5975 
0.0464 


0.2055 
-1.6222 
0.9620 
0.9347 
-0.8897 
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~1.1613 
0.6500 


1.6165 
0.0475 
-1.4437 
0.8907 
-1.1346 


1.4385 
0.2328 
0.2062 
-1.5345 
-0.4928 


-0,5841 
0.1518 
0.3180 
1.1551 
0.1820 


-0.4705 
0.9670 
-1.4820 
1.1444 
0.0293 


0.3489 
-0.7011 


0.9104 -0.0423 


1.4756 
1.3478 


-0.5155 
0.4917 
1.5164 

-0.4471 
0.9266 


1.1780 
-0.8993 
0.6425 
0.4918 
0.6104 
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-0.9519 2.4481 -1.2937 -0.3668 0.3879 -0.4519 -0.7694 1.4488 
~1.2364 -0.4280 -0.5502 -1.1491 0.0441 -0.1506 0.5166 0.4555 
0.2610 0.7267 1.0333 0.4426 -0.5912 -0.6600 -1.3500 -1.1604 


-2.1545 -0.1767 
0.8222 1.7626 


-0.7251 -0.9607 
-0.0944 0.6800 
0.9985 ~0.2006 
0.4311 0.4879 
-0.8556 -1.5648 


0.4879 -0.0615 
0.2764 0.5612 
0.5703 -0.7782 
0.2521 0.0286 
0.6253 -2.3459 


2.6767 1.7088 
-0.7753 -1.4071 
0.0787 -0.9492 
-0.1413 -1.9574 
-~0.7459 0.2789 


1.8856 -1.3087 
0.6316 0.9936 
0.2212 1.0139 
1.5919 -0.4227 
-1.4862 -1.5943 


1.2612 1.6360 
1.4512 -0.1605 
-0.6756 
0.4546 -0.7222 
2.3351 0.1138 


0.8359 0.6713 
0.8378 0.2625 
-0.7221 -0.6674 
0.2909 0.0715 
-0.4778 0.7206 


-1.6038 0.3676 
0.1293 0.3592 
0.1769 1.1089 

-0.7745 0.5588 

-0.7964 -1.0848 


-0.3837 0.2718 
-0.0396 -1.4831 


1.1737 1.9487 
0.2583 -1.3042 
1.5134 -0.1220 
-0.5611 -0.4382 
-1.4676 -0.5550 


3.3023 0.0828 
1.2642 0.0228 
-0.3347 0.6958 
-1.0374 -0.3167 
-0.6131 -0.7769 


-0.6897 0.4119 
-0.3088 -0.7625 
0.8690 -0.2502 
0.8299 0.0078 
-0.4731 -0.8643 


-0.4818 0.0037 
~0.3840 0.0104 
1.6695 0.6186 
-1.0081 -0.3190 
0.4494 1.6794 


1.1331 0.7388 
1.2953 0.4469 
0.0033 -0.8384 
~0.5631 0.4012 
-0.1341 1.3255 


0.4304 0.2434 
-0.6391 0.4374 
-0.6961 -0.4910 

1.3707 -1.6617 

0.7800 0.0541 


0.2777 -0.6000 
-0.6395 0.2762 
1.0447 1.6418 
-0.2296 


1.0682 
0.7821 


1.3683 
-0.5009 
-1.2408 
-0.0232 


0.7272 -1.9370 
0.7722 -0.1627 


0.1228 1.7110 
1.5905 0.2739 
0.9547 0.9419 
0.7202 -1.6899 


1.9783 -0.6002 -0.8623 


~0.4150 -0.3619 -1.1449 
-0.0113 -1.1525 0.8575 
-1.6643 -0.1400 -0.4758 
0.0337 -1.5653 -0.3404 


~0.8436 
0.7105 


0.0709 1.6279 
1.5529 -0.8765 


1.2727 -1.1864 0.5924 
-0.2786 -0.1053 0.9414 


0.3747 
1.1979 


~0.1947 
-1.8476 


0.4133 -0.0213 
0.6021 0.1031 


0.5400 0.2916 
1.0431 -0.7333 


1.0103 -0.1774 0.7496 
-1.4802 -1.0290 1.2259 


1.4924 


0.3306 -0.6443 


0.2482 -0.9710 -0.0743 
2.6422 -0.5301 -0.3351 
0.1191 -0.9857 0.0763 
-0.4609 -0.3180 -0.6076 


0.1626 


-0.4370 
0.4508 
0.9222 

-0.9242 


0.4069 0.4443 


1.9007 2.3921 
2.0110 1.0462 
0.1287 -0.3559 
0.4884 0.8732 


-0.9749 -0.3148 -1.7382 


-0.3385 -0.1781 0.0006 
0.2523 -0.4792 -1.9707 
1.3131 -0.4279 -0.2146 


0.7240 -0.7744 1.0531 -0.3648 


1.0418 -2.3557 -2.2113 -0.2154 -0.4277 
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0.4524 -0.3759 -1.1044 -0.7922 -0.3908 0.5027 -0.8777 0.8350 
1.3386 -0.9384 0.9778 0.5967 0.0569 -0.4893 -0.1772 -0.1316 
0.3578 -0.0375 ~-0.2984 -0.2797 -1.2492 0.4011 -2.9682 -0.3262 
-0.0892 1.5757 0.9775 -0.6173 2.0789 -0.5821 0.4185 -1.1778 
-0.7167 -1.9664 -2.8695 -0.7468 0.1990 -0,2504 -0.6488 -0.6664 


-0.9094 -0.3657 0.0288 -0.4141 1.7280 1.7529 1.2845 -2.5891 
-2.1042 -0.3644 -0.0820 0.1336 0.2047 0.6016 0.0180 1.6723 
1.2015 -0.4216 1.2343 0.7220 -1.0582 -1.6844 1.3734 2.2182 
-1.1590 1.3998 -0.7100 0.4390 0.3926 0.0069 -0.9308 0.4610 
-0.1055 ~0.6711 -0.5184 0.4196 -0.7284 0.3841 0.2318 0.4297 


-0.4567 1.5014 2.2306 0.1521 -1.3206 0.2484 -1.0647 1.3832 
0.8239 0.5671 -0.8463 -0.8796 -0.4575 0.8069 0.1745 0.8412 
1.6095 -1.4104 0.9356 -0.5948 -1.4605 0.3091 -0.7754 -0.3175 

1.8891 -2.4284 -0.3966 -0.0619 0.8885 0.2336 -0.6679 -0.9313 
0.2825 0.1674 -0.5899 -0.0527 -0.6281 1.8808 0.3292 -0.5599 


1.2549 1.8023 -0.4091 0.7098 -1.1160 0.3299 -1.3397 -0.4159 
0.6437 1.2778 0.8326 -0.5625 0.9745 0.9824 -0.0795 0.8991 
0.8561 -0.2334 2.1016 0.1617 -0.0937 0.7040 -2.1222 0.6033 

-0.7138 1.7661 0.3209 -1.2491 -0.1967 -0.8866 0.1061 -0.0075 

-0.3168 -0.2093 -0.0818 0.4804 0.6276 -0.5237 2.0092 -2.1167 


-0.5841 1.0182 -0.1987 1.0343 1.1264 1.7410 -1.2655 -0,5511 
-0.0420 -0.1988 1.3127 1.5628 2.3005 2.8135 0.4605 0.2354 
0.4347 0.4712 -0.6811 1.8966 -0.9681 1.8779 0.2103 0.9124 
0.0955 1.0883 1.0695 -1.6493 1.2889 -0.6178 0.9129 -0.1627 
-0.8127 0.2933 0.8815 0.9142 1.2547 1.6200 -0.1370 0.0929 


NOTES 


Random digits simulate the results of a sequence of identical 
experiments, in each of which a digit is extracted ‘‘by lot’’ from 
the set 0, 1, 2, 3, 4, 5, 6, 7, 8 and 9 (that is, the probability of 
occurrence of each of the digits is equalto 1/10). The experiments 
are independent of each other. 

The 5000 digits givenin TableI (page 273) are the first decimal 
digits of pseudo-random numbers {7} used on the ‘‘Strela’’ com- 
puter. A description and the results of tests for ‘‘randomness’”’ 
carried out on these numbers are found in the paper [57]. The 
digits are divided for the sake of convenience into groups of five. 

Normal quantities simulate a sequence of values ofa normally 
distributed (Gaussian) random quantity with mean 0 and variance 1. 
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The 1000 numbers given in Table II (page 276) have been cal- 
culated on a ‘‘Strela’’ computer from pseudo-random numbers 
{y} according to a method suggested in the paper [7] using the 
formula 


¢ = 0.016, (97 + &)s 


where 


Me 


=V06 Y Cu—Y 


1 


(each ¢ is calculated from five values of 1). 
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